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1 

CHIMERIC GENES SUITABLE FOR 
5 EXPRESSION IN PLANT CELLS 

Technical Field 

This invention is in the fields of genetic 
10 engineering, plant biology, and bacteriology. 

Background Art 

In the past decade, the science of genetic 

15 engineering has developed rapidly. A variety of 

processes are known for inserting a heterologous gene 
into bacteria, whereby the bacteria become capable of 
efficient expression of the inserted genes. Such 
processes normally involve the use of plasmids which 

20 may be cleaved at one or more selected cleavage sites 
by restriction endonucleases , discussed below. 
Typically, a gene of interest is obtained by cleaving 
one piece of DNA and the resulting DNA fragment is 
mixed with a fragment obtained by cleaving a vector 

25 such as a plasmid. The different strands of DNA are 
then connected ("ligated") to each other to form a 
reconstituted plasmid. See, for example, U.S. Patents 
4,237,224 (Cohen and Boyer, 1980); 4,264,731 (Shine, 
1981); 4,273,875 (Manis, 1981); 4,322,499 (Baxter et 

30 al, 1982), and 4,336,336 (Silhavy et al, 1982). A 
variety of other reference works are also available. 
Some of these works describe the natural processes 
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whereby DNA is transcribed into messenger RNA (mRNA) 
and mRNA is translated into protein; see, e.g., 
Stryer, 1981 (note: all references cited herein, other 
than patents, are listed with citations after the 
5 Examples); Lehninger, 1975. Other works describe 
methods and products of genetic manipulation; see, 
e.g., Maniatis et al, 1982; Setlow and 
Hollaender,1979. 

10 Most of the genetic engineering work 

performed to date involves the insertion of genes into 
various types of cells, primarily bacteria such as 
E. coli , various other types of microorganisms such as 
yeast, and mammalian cells. However, many of the 

15 technigues and substances used for genetic engineering 
of animal cells and microorganisms are not directly 
applicable to genetic engineering" involving plants . 

As used herein, the term "plant" refers to a 
20 multicellular differentiated organism that is capable 
of photosynthesis, such as angiosperms and 
multicellular algae. This does not include 
microorganisms, such as bacteria, yeast, and fungi. 
However, the term "plant cells" includes any cell 
25 derived from a plant; this includes undifferentiated 
tissue such as callus or crown gall tumor, as well as 
plant seeds, propagules, pollen, and plant embryos. 

A variety of plant genes have been 
30 isolated, some of which have been published and/or are 
publicly available. Such genes include the soybean 
actin gene (Shah et al 1982), corn zein (Pederson et 
al, 1982) soybean leghemoglobin (Hyldig-Nielsen et al, 
1982), and soybean storage proteins (Fischer and 
35 Goldberg, 1982). 
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The Regions of a Gene 

The expression of a gene involves the 
creation of a polypeptide which is coded for by the 
5 gene. This process involves at least two steps: part 
of the gene is transcribed to form messenger RNA, and 
part of the mRNA is translated into a polypeptide. Although 
the processes of transcription and translation are not 
fully understood, it is believed that the 

10 transcription of a DNA sequence into mRNA is 

controlled by several regions of DNA. Each region is 
a series of bases (i.e., a series of nucleotide 
residues comprising adenosine (A), thymidine (T), 
cytidine (C), and guanidine (G)) which are in a 

15 desired sequence. Regions which are usually present 
in a eucaryotic gene are shown on Figure 1. These 
regions have been assigned names for use herein, and 
are briefly discussed below. It should be noted that 
a variety of terms are used in the literature, which 

20 describes these regions in much more detail. 

An association region 2 causes RNA 
polymerase to associate with the segment of DNA. 
Transcription does not occur at association region 2; 
25 instead, the RNA polymerase normally travels along an 
intervening region 4 for an appropriate distance, such 
as about 100-300 bases, after it is activated by 
association region 2. 

30 A transcription initiation sequence 6 

directs the RNA polymerase to begin synthesis of mRNA. 
After it recognizes the appropriate signal, the RNA 
polymerase is believed to begin the synthesis of mRNA 
an appropriate distance, such as about 20 to about 30 

35 bases, beyond the transcription initiation sequence 6. 
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This is represented in Figure 1 by intervening region 
8. 

The foregoing sequences are referred to 
5 collectively as the promoter region of the gene. 

The next sequence of DNA is transcribed by 
RNA polymerase into messenger RNA which is not 
translated into protein. In general, the 5 1 end of a 

10 strand of mRNA attaches to a ribosome. In bacterial 

cells, this attachment is facilitated by a sequence of 
bases called a "ribosome binding site" (RBS). 
However, in eucaryotic cells, no such RBS sequence is 
known to exist- Regardless of whether an RBS exists 

15 in a strand of mRNA, the mRNA moves through the 

ribosome until a "start codon" is encountered. The 
start codon is usually the series" of three bases, AUG; 
rarely, the codon GUG may cause the initiation of 
translation. The non- translated portion of mRNA 

20 located between the 5' end of the mRNA and the start 
codon is referred to as the 5' non-translated region 
10 of the mRNA. The corresponding sequence in the DNA 
is also referred to herein as 5' non- translated region 
12. The specific series of bases in this sequence is 

25 not believed to be of great importance to the 

expression of the gene; however, the presence of a 
premature start codon might affect the translation of 
the mRNA (see Kozak, 1978). 

30 A promoter sequence may be significantly 

more complex than described above; for example, 
certain promoters present in bacteria contain 
regulatory sequences that are often referred to as 
"operators." Such complex promoters may contain one 

35 or more sequences which are involved in induction or 
repression of the gene. One example is the lac 
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operon, which normally does not promote transcription 
of certain lactose-utilizing enzymes unless lactose is 
present in the cell- Another example is the trp 
operator, which does not promote transcription or 
5 translation of certain tryptophan-creating enzymes if 
an excess of tryptophan is present in the cellJ See, 
e.g., Miller and Reznikoff, 1982. 

The next sequence of bases is usually 

10 called the coding sequence or the structural sequence 
14 (in the DNA molecule) or 16 (in the mRNA molecule). 
As mentioned above, the translation of a polypeptide 
begins when the mRNA start codon, usually AUG, reaches 
the translation mechanism in the ribosome. The start 

15 codon directs the ribosome to begin connecting a 

series of amino acids to each other by peptide bonds 
to form a polypeptide, starting with methionine, which 
always forms the amino terminal end of the polypeptide 
(the methionine residue may be subsequently removed 

20 from the polypeptide by other enzymes). The bases 

which follow the AUG start codon are divided into sets 
of 3, each of which is a codon. The "reading frame", 
which specifies how the bases are grouped together 
into sets of 3, is determined by the start codon. 

25 Each codon codes for the addition of a specific amino 
acid to the polypeptide being formed. The entire 
genetic code (there are 64 different codons) has been 
solved; see, e.g., Lehninger, supra, at p. 962. For 
example, CUA is the codon for the amino acid leucine; 

30 GGU specifies glycine, and UGU specifies cysteine. 



Three of the codons (UAA, UAG, and UGA) are 
"stop" codons; when a stop codon reaches the 
translation mechanism of a ribosome, the polypeptide 
35 that was being formed disengages from the ribosome, 
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and the last preceding amino acid residue becomes the 
carboxy terminal end of the polypeptide.. 

The region of raRNA which is located on the 
5 3' side of a stop codon in a monocistronic gene is 
referred to herein as 3' non-translated region 18. 
This region 18 is believed to be involved in the 
processing, stability, and/or transport of the mRNA 
after it is transcribed. This region 18 is also 
10 believed to contain a sequence of bases, 

poly-adenylation signal 20, which is recognized by an 
enzyme in the cell. This enzyme adds a substantial 
number of adenosine residues to the mRNA molecule, to 
form poly-A tail 22. 

15 

The DNA molecule has a 3' non- translated 
region 24 and a poly-adenylation signal 26, which code 
for the corresponding mRNA region 18 and signal 20. 
However, the DNA molecule does not have a poly-A tail. 
20 Poly-adenylation signals 20 (mRNA) and 26 (DNA) are 
represented in the figures by a heavy dot. 

Gene-Host Incompatibility 

The same genetic code is utilized by all 
25 living organisms on Earth. Plants, animals, and 

microorganisms all utilize the same correspondence 
between codons and amino acids. However, the genetic 
code applies only to the structural sequence of a 
gene, i.e., the segment of mRNA bounded by one start 
30 codon and one stop codon which codes for the 
translation of mRNA into polypeptides. 

However, a gene which performs efficiently 
in one type of cell may not perform at all in a 
35 different type of cell. For example, a gene which is 
expressed in E. coli may be transferred into a 




WO 84/02913 



PCT/US84/00048 



-7- 

different type of bacterial cell, a fungus, or a 
yeast. However, the gene might not be expressed in 
the new host cell. There are numerous reasons why an 
intact gene which is expressed in one type of cell 
5 might not be expressed in a different type of cell. 

See, e.g., Sakaguchi and Okanishi, 1981. Such reasons 
include : 

1. the gene might not be replicated or 

10 stably inherited by the progeny of the new host cell. 

2. the gene might be broken apart by 
restriction endonucleases or other enzymes in the new 
host cell. 

15 

3 . the promoter region of the gene might not 
be recognized by the RNA polymerases in the new host 
cell. 

20 4. one or more regions of the gene might be 

bound by a repressor protein or other molecule in the 
new host cell, because of a DNA region which resembles 
an operator or other regulatory sequence of the host's 
DNA. For example, the lac operon includes a 

25 polypeptide which binds to a particular sequence of 

bases next to the lac promoter unless the polypeptide 
is itself inactivated by lactose. See, e.g., Miller 
and Reznikoff, 1982. 

30 5. one or more regions of the gene might be 

deleted, reorganized, or relocated to a different part 
of the host's genome. For example, numerous 
procaryotic cells are known to contain enzymes which 
promote genetic recombination; (such as the rec 

35 proteins in E. coli ; see, e.g., Shibata et al, 1979) 
and transposition (see, e.g., The 45th Cold Spring 
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Harbor Symposium on Quantitative Biology, 1981). In 
addition, naturally-occurring genetic modification can 
be enhanced by regions of homology between different 
strands of DNA; see, e.g., Radding, 1978. 

5 

6. mRNA transcribed from the gene may suffer 
from a variety of problems. For example, it might be 
degraded before it reaches the ribosome, or it might 
not be poly-adenylated or transported to the ribosome, 

10 or it might not interact properly with the ribosome, 
or it might contain an essential sequence which is 
deleted by RNA processing enzymes. 

7. the polypeptide which is created by 
15 translation of the mRNA coded for by the gene may 

suffer from a variety of problems. For example, the 
polypeptide may have a toxic effect on the cell, or it 
may be glycosylated or converted into an altered 
polypeptide, or it may be cleaved into shorter 
20 polypeptides or amino acids, or it may be sequestered 
within an intracellular compartment where it is not 
functional . 

In general, the likelihood of a foreign gene 
25 being expressed in a cell tends to be lower if the new 
host cell is substantially different from the natural 
host cell. For example, a gene from a certain species 
of bacteria is likely to be expressed by other species 
of bacteria within the same genus. The gene is less 
30 likely to be expressed by bacteria of a different 
genus, and even less likely to be expressed by 
non-bacterial microorganisms such as yeast, fungus, or 
algae. It is very unlikely that a gene from a cell of 
one kingdom (the three kingdoms are plants, animals, 
35 and "protista" (microorganisms)) could be expressed in 
cells from either other kingdom. 
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These and other problems have, until now, 
thwarted efforts to obtain expression of foreign genes 
into plant cells. For example, several research teams 
have reported the insertion of foreign DNA into plant 
5 cells; see, e.g., Lurguin, 1979; Krens et al, 1982; 

Davey et al, 1980. At least threeteams of researchers 
have reported the insertion of entire genes into plant 
cells. By use of radioactive DNA probes, these 
researchers have reported that the foreign genes (or 

10 at least portions thereof) were stably inherited by 

the descendants of the plant cells. See Hernalsteens 
et al, 1980; Garfinkel et al, 1981; Matzke and 
Chilton, 1981. However, there was no reported 
evidence that the foreign genes were expressed in the 

15 plant cells. 

Several natural exceptions to the gene-host 
incompatibility barriers have been discovered. For 
example, several E. coli genes can be expressed in 
20 certain types of yeast cells, and vice-versa. See 
Beggs, 1978; Struhl et al, 1979. 

In addition, certain types of bacterial 
cells, including Agrobacterium tumefaciens and A^ 

25 rhizogenes , are capable of infecting various types of 
plant cells, causing plant diseases such as crown gall 
tumor and hairy root disease. These Agrobacterium 
cells carry plasmids, designated as Ti plasmids and Ri 
plasmids, which carry genes which are expressed in 

30 plant cells. Certain of these genes code for enzymes 
which create substances called "opines," such as 
octopine, nopaline, and agropine. Opines are utilized 
by the bacteria cells as sources of carbon, nitrogen, 
and energy. See, e.g., Petit and Tempe, 1978. The 

35 opine genes are believed to be inactive while in the 



WO 84/02913 



PCT/US84/00048 



-10- 

bacterial cells; these genes are expressed only after 
they enter the plant cells. 

In addition, a variety of man-made efforts 
have been reported to overcome one or more of the 
gene-host incompatibility barriers. For example, it 
has been reported that a mammalian polypeptide which 
is normally degraded within a bacterial host can be 
protected from degradation by coupling the mammalian 
polypeptide to a bacterial polypeptide that normally 
exists in the host cell. This creates a "fusion 
protein;" see, e.g., Itakura et al, 1977. As another 
example, in order to avoid cleavage of an inserted 
gene by endonucleases in the host cell, it is possible 
to either (1) insert the gene into host cells which 
are deficient in one or more endonucleases, or (2) 
duplicate the gene in cells which cause the gene to be 
methylated. See, e.g., Maniatis et al, 1981. 

In addition, various efforts to overcome 
gene-host incompatibility barriers involve chimeric 
genes. For example, a structural sequence which codes 
for a mammalian polypeptide, such as insulin, 
interferon, or growth hormone, may be coupled to 
regulatory sequences from a bacterial gene. The 
resulting chimeric gene may be inserted into bacterial 
cells, where it will express the mammalian 
polypeptide. See, e.g., Guarente et al, 1980. 
Alternately, structural sequences from several 
bacterial genes have been coupled to regulatory 
sequences from viruses which are capable of infecting 
mammalian cells. The resulting chimeric genes were 
inserted into mammalian cells, where they reportedly 
expressed the bacterial polypeptide. See, e.g., 
Southern and Berg, 1982; Colbere-Garapin et al, 1982. 
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30 



Restriction Endonucleases 

In general, an endonuclease is an enzyme 
which is capable of breaking DNA into segments of DNA. 
An endon uclease is capable of attaching to a strand of 
DNA somewhere in the middle of the strand, and 
breaking it. By comparison, an exo nuclease removes 
nucleotides from the end of a strand of DNA. All of 
the endonucleases discussed herein are capable of 
breaking double-stranded DNA into segments. This may 
require the breakage of two types of bonds: (1) 
covalent bonds between phosphate groups and 
deoxyribose residues, and (2) hydrogen bonds (A-T and 
C-G) which hold the two strands of DNA to each other. 

A "restriction endonuclease" (hereafter 
referred to as an endonuclease) breaks a segment of 
DNA at a precise sequence of bases. For example, 
EcoRI and Haelll recognize and cleave the following 
sequences : 



^ XXG _|_ AATTCXX 
YYCTTAA GYY 



GGCC 
CCGG 



In the examples cited above, the EcoRI 
cleavage created a "cohesive" end with a 5' overhang 
(i.e., the single-stranded "tail" has a 5' end rather 
35 than a 3 1 end). Cohesive ends can be useful in 
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promoting desired ligations. For example, an EcoRI 
end is more likely to anneal to another EcoRI end than 
to a Haelll end. 



each of which is capable of cleaving DNA at specific 
sequences. See, e.g., Roberts, 1982. All restriction 
endonucleases are sensitive to the sequence of bases. 
In addition, some endonucleases are sensitive to 
10 whether certain bases have been methylated. For 

example, two endonucleases, Mbol and Sau3a are capable 
of cleaving the following sequence of bases as shown: 



adenine residue is methylated (me-A) . Sau3a can 
cleave this sequence, regardless of whether either A 

20 is methylated. To some extent the methylation (and 

therefore the cleavage) of a plasmid may be controlled 
by replicating the plasmids in cells with desired 
methylation capabilities. An E. coli enzyme, DNA 
adenine methylase (dam), methylates the A residues 

25 that occur in GATC sequences. Strains of E. coli which 
do not contain the dam enzyme are designated as dam- 
cells. Cells which contain dam are designated as dam+ 
or dam cells. 

30 Several endonucleases are known which cleave 

different sequences, but which create cohesive ends 
which are fully compatible with cohesive ends created 
by other endonucleases. For example, at least five 
different endonucleases create 5' GATC overhangs, as 

35 shown in Table 1. 



5 



Over 100 different endonucleases are known, 



15 




GATCXX 



Mbol cannot cleave this sequence if the 



r ompi _ > 
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Table 1 

Endonuclease 

5 

Mbol 

Inhibited by me-A 
Sau3a 

10 Unaffected by me-A 

Bglll 
Unaffected by me-A 

15 Bell 

Inhibited by me-A 



BaraHI 

20 Unaffected by me-A 

A cohesive end created by any of the 
enconucleases listed in Table 1 will ligate 
preferentially to a cohesive end created by any of the 
25 other endonucleases . However, a ligation of, for 

example, a Bglll end with a BamHI end will create the 
following sequence: 

AGATCC 
TCTAGG 

30 This sequence cannot be cleaved by either Bgl II or 

BamHI; however, it can be cleaved by Mbol (unless 
methylated) or by Sau3a. 



Sequence 
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Another endonuclease which involves the GATC 
sequence is Pvul, which creates a 3' overhang, as 
follows : 



Another endonuclease, Clal, cleaves the 
following sequence: 



Y TAGCpAY 

If X 1 is G, or if X 2 is C, then the sequence may be 
cleaved by Mbol (unless methylated, in which case Clal 
15 is also inhibited) or Sau3a. 

Viral Promoters 

A virus is a microorganism comprising single 
or double stranded nucleic acid (DNA or RNA) contained 

20 within a protein (and possibly lipid) shell called a 
"capsid" or "coat". A virus is smaller than a cell, 
and it does not contain most of the components and 
substances necessary to conduct most biochemical 
processes. Instead, a virus infects a cell and uses 

25 the cellular processes to reproduce itself. 

The following is a simplified description of 
how a DNA-containing virus infects a cell; RNA viruses 
will be disregarded in this introduction for the sake 
of clarity. First, a virus attached to or enters a 

30 cell, normally called a "host" cell. The DNA from the 
virus (and possible the entire viral particle) enters 
the host cell where it usually operates as a plasmid 
(a loop of extra-chromosomal DNA). The viral DNA is 
transcribed into messenger RNA, which is translated 

35 into one or more polypeptides. Some of these 

polypeptides are assembled into new capsids, while 
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others act as enzymes to catalyze various biochemical 
reactions. The viral DNA is also replicated and 
assembled with the capsid polypeptides to form new 
viral particles. These viral particles may be 
5 released gradually by the host cell, or they may cause 
the host cell to lyse and release them. The released 
viral particles subsequently infect new host cells. 
For more background information on viruses see, e.g., 
Stryer, 1981 and Matthews, 1970. 

10 As used herein, the term "virus" includes 

phages and viroids, as well as replicative 
intermediates. As used herein, the phrases "viral 
nucleic acid" and "DNA or RNA derived from a virus" are 
construed broadly to include any DNA or RNA that is 
15 obtained or derived from the nucleic acid of a virus. 
For example, a DNA strand created by using a viral RNA 
strand as a template, or by chemical synthesis to 
create a known sequence of bases determined by 
analyzing viral DNA, would be regarded as viral 
20 nucleic acid. 

The host range of any virus (i.e., the 
variety of cells that a type of virus is capable of 
infecting) is limited. Some viruses are capable of 
efficient infection of only certain types of bacteria; 
other viruses can infect only plants, and may be 
limited to certain genera; some viruses can infect 
only mammalian cells. Viral infection of a cell 
requires more than mere entry of the viral DNA or RNA 
into the host cell; viral particles must be reproduced 
within the cell. Through various assays, those 
skilled in the art can readily determine whether any 
particular type of virus is capable of infecting any 
particular genus, species, or strain of cells. As 
used herein, the term "plant virus" is used to 
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designate a virus which is capable of infecting one or 
more types of plant cells, regardless of whether it 
can" infect other types of cells. 

With the possible exception of viroids 
5 (which are poorly understood at present), every viral 
particle must contain at least one gene which can be 
"expressed" in infected host cells. The expression of 
a gene requires that a segment of DNA or RNA must be 
transcribed into or function as a strand of messenger 

10 RNA (mRNA) , and the mRNA must be translated into a 
polypeptide. Most viruses have about 5 to 10 
different genes, all of which are expressed in a 
suitable host cell. 

Promoters from viral genes have been 

15 utilized in a variety of genetic engineering 

applications. For example, chimeric genes have been 
constructed using various structural sequences (also 
called coding sequences) taken from bacterial genes, 
coupled to promoters taken from viruses which can 

20 infect mammalian cells (the most commonly used 

mammalian viruses are designated as Simian Virus 40 
(SV40) and Herpes Simplex Virus (HSV)). These 
chimeric genes have been used to transform mammalian 
cells. See, e.g., Mulligan et al 1979; Southern and 

25 Berg 1982. In addition, chimeric genes using 
promoters taken from viruses which can infect 
bacterial cells have been used to transform bacterial 
cells; see, e.g., the phage lambda PL promoter 
discussed in Maniatis et al, 1982. 

30 Several researchers have theorized that it 

might be possible to utilize plant viruses as vectors 
for transforming plant cells. See, e.g., Hohn et al, 
1982. In general, a "vector" is a DNA molecule useful 
for transferring one or more genes into a cell. 

35 Usually, a desired gene is inserted into a vector, and 
the vector is then used to infect the host cell. 
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Several researchers have theorized that it 
might be possible to create chimeric genes which are 
capable of being expressed in plant cells, by using 
promoters derived from plant virus genes. See, e.g., 
5 Hohn et al, 1982, at page 216. 

However, despite the efforts of numerous 
research teams, prior to this invention no one had 
succeeded in (1) creating a chimeric gene comprising a 
plant virus promoter coupled to a heterologous 
10 structural sequence and (2) demonstrating the 

expression of such a gene in any type of plant cell. 

Cauliflower Mosaic Virus (CaMV) 

The entire DNA sequence of CaMV has been 
published. Gardner et al, 1981; Hohn et al, 1982. In 

15 its most common form, the CaMV genome is about 8000 bp 
long. However, various naturally occurring infective 
mutants which have deleted about 500 bp have been 
discovered; see Howarth et al 1981. The entire CaMV 
genome is transcribed into a single mRNA, with a 

20 sedimentation coefficient of 35S. The promoter for 

the 32 S mRNA is located in the large intergenic region 
about 1 kb counterclockwise from Gap 1 (see Guilley et 
al, 1982). 

CaMV is believed to generate at least eight 
25 proteins; the corresponding genes are designated as 

Genes I through VIII. Gene VI is transcribed into mRNA 
with a sedimentation coefficient of 19S. The 19S mRNA 
is transcribed into a protein designated as P66, which 
is an inclusion body protein. The 19S mRNA is 
30 promoted by the 19S promoter, located about 2.5 kb 
counterclockwise of Gap 1. 
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SUMMARY OF THE INVENTION 



This invention relates to chimeric genes 
which are capable of being expressed in plant cells, 
and to a method for creating such genes. 



5 



The chimeric gene comprises a promoter 



region which is capable of causing RNA polymerase in a 
plant cell to create messenger RNA corresponding to 
the DNA. One such promoter region comprises a 

10 nopaline synthase (NOS) promoter region, which 

normally exists in certain types of Ti plasmids in 
bacteria, A. tumefaciens . The NOS gene normally is 
inactive while contained in A. tumefaciens cells, and 
it becomes active after the Ti plasmid enters a plant 

15 cell. Two other promoters have been derived from the 
cauliflower mosaic virus (CaMV). Other suitable 
promoter regions may be derived from genes which exist 
naturally in plant cells, or from viruses which are 
capable of infecting plant cells. 



The chimeric gene also contains a sequence 
of bases which codes for a 5' non-translated region of 
mRNA which is capable of enabling or increasing the 
expression in a plant cell of a structural sequence of 
25 the mRNA. For example, a suitable 5' non- translated 
region may be taken from the NOS gene, from a plant 
virus gene, or from a gene which exists naturally in 
plant cells. 

30 The chimeric gene also contains a desired 

structural sequence, i.e., a sequence which is 
transcribed into mRNA which is capable of being 
translated into a desired polypeptide. The structural 
sequence is heterologous with respect to the promoter 

35 region, and it may code for any desired polypeptide, 



20 
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such as a bacterial or mammalian protein. The 
structural sequence includes a start codon and a stop 
codon. The structural sequence may contain introns 
which are removed from the mRNA prior to translation. 

5 If desired, the chimeric gene may also 

contain a DNA sequence which codes for a 3' 
non- translated region (including a poly-adenylation 
signal) of mRNA. This region may be derived from a 
gene which is naturally expressed in plant cells, to 
10 help ensure proper expression of the structural 
sequence. Such genes include the NOS gene, plant 
virus genes, and genes which exist naturally in plant 
cells. 

15 The method of this invention is described 

below, and is summarized in the flow chart of Figure 
2. 

If properly assembled and inserted into a 
plant genome, a chimeric gene of this invention will 
be expressed in the plant cell to create a desired 
polypeptide, such as a mammalian hormone, or a 
bacterial enzyme which confers antibiotic or herbicide 
resistance upon the plant. 

Brief Description of the Drawings 

The figures herein are schematic 
representations; they have not been drawn to scale. 

Figure 1 represents the structure of a 
typical eukaryotic gene. 

Figure 2 is a flow chart representing the 
35 steps of this invention, correlated with an example 
chimeric NOS-NPTII-NOS gene. 
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Figure 3 represents fragment Hindi I I -23, 



obtained by digesting a Ti plasmid with Hindi II. 

Figure 4 represents a DNA fragment which 
5 contains a NOS promoter region, a NOS 5 1 

non-translated region, and the first few codons of the 
NOS structural sequence. 

Figure 5 represents the cleavage of a DNA 
10 sequence at a precise location, to obtain a DNA 

fragment which contains a NOS promoter region and 
complete 5' non-translated region. 

Figure 6 represents the creation of plasmids 
15 pMONlOOl and pMON40, which contain an NPTII structural 
sequence . 

Figure 7 represents the insertion of a NOS 
promoter region into plasmid pMON40, to obtain pMON58. 

20 

Figure 8 represents the creation of an. M13 
derivative designated as M-2, which contains a NOS 3 1 
non- translated region and poly-A signal. 

25 Figure 9 represents the assembly of the 

NOS-NPTII-NOS chimeric gene, and the insertion of the 
chimeric gene into plasmid pMON38 to obtain plasmids 
pMON75 and pMON76. 

30 Figure 10 represents the insertion of the 

NOS-NPTII-NOS chimeric gene into plasmid pMON120 to 
obtain plasmids pMON128 and pMON129. 



35 



Figure 11 represents the creation of plasmid 
pMON66, which contains an NPTI gene. 




WO 84/02913 



PCT/US84/00048 



-21- 



10 



15 



20 



25 



30 



Figure 12 represents the creation of plasmid 
pMON73, containing a chimeric NOS-NPTII sequence. 

Figure 13 represents the creation of plasmid 
pMON78, containing a chimeric NOS-NPTI sequence. 

Figure 14 represents the creation of 
plasmids pMON106 and pMON107, which contain chimeric 
NOS-NPTI-NOS genes. 

Figure 15 represents the insertion of a 
chimeric NOS-NPTI-NOS gene into pMON120 to obtain 
plasmids pMON130 and pMON131. 

Figure 16 represents the structure of a DNA 
fragment containing a soybean protein (sbss) promoter. 

Figure 17 represents the creation of plasmid 
pM0N121, containing the sbss promoter. 

Figure 18 represents the insertion of a 
chimeric sbss-NPTII-NOS gene into pMON120 to create 
plasmids pM0N141 and pMON142 . 

Figure 19 represents the creation of plasmid 
PMON108, containing a bovine growth hormone structural 
sequence and a NOS 3' region. 

Figure 20 represents the creation of plasmid 
N25-BGH, which contains the BGH-NOS sequence 
surrounded by selected cleavage sites. 

Figure 21 represents the insertion of a 
chimeric sbss-BGH-NOS gene into pMON120 to obtain 
plasmids pM0N147 and pMON148. 
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Figure 22 represents the creation of plasmid 
pMON149, which contains a chimeric NOS-BGH-NOS gene. 

5 Figure 23 represents the creation of plasmid 

pMON8, which contains a structural sequence for EPSP 
synthase . 

Figure 24 represents the creation of plasmid 
10 pMON25, which contains an EPSP synthase structural 
sequence with several cleavage sites near the start 
codon. 

Figure 25 represents the creation of plasmid 
15 pM0N146, which contains a chimeric sequence comprising 
EPSP synthase and a NOS 3* region. 

Figure 26 represents the insertion of a 
chimeric NOS-EPSP-NOS gene into pMON120 to obtain 
20 plasmid pMON153. 

Figure 27 represents the creation of plasmid 
PMON154, which contains a chimeric sbss-EPSP-NOS gene. 

Figure 28 represents the creation and 
25 structure of plasmid pMON93, which contains a CaMV 19S 
promoter. 

Figure 29 represents the creation and 
structure of plasmid pMON156, which contains a 
chimeric CaMV(19S)-NPT-NOS gene. 

30 Figure 30 represents the creation and 

structure of plasmid pMONHO, which contains a partial 
NPT gene. 
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Figure 31 represents the creation and 
structure of plasmid pMON132, which contains a partial 
NPT--NOS gene. 

Figure 32 represents the creation and 
structure of plasmid pMON155, which contains a 
chimeric CaMV(19S)-NPT-NOS gene. 

Figure 33 represents the creation and 
structure of plasmid pDMON81, which contains a CaMV 
32S promoter. 

Figure 34 represents the creation and 
structure of plasmid pM0N125, which contains a CaMV 
32S promoter. 

Figure 35 represents the creation and 
structure of plasmid pMON172, which contains a CaMV 
32S promoter. 

Figure 36 represents the creation and 
structure of phage M12, which contains a CaMV 32S 
promoter . 

Figure 37 represents the creation and 
structure of plasmids pMON183 and pMON184, which 
contain chimeric CaMV(32S)-NPT-NOS genes. 

DETAILED DESCRIPTION OF THE INVENTION 

In one preferred embodiment of this 
invention, a chimeric gene was created which contained 
the following elements: 



region derived from a nopaline synthase (NOS) gene; 



1 . a promoter region and a 5 ' non- translated 
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2. a structural sequence derived from a 
neomycin phosphotransferase II (NPTII or NPT II) gene; 
and, 

5 3. a 3 f non- translated region, including a 

poly-adenylation signal, derived from a NOS gene. 

This chimeric gene, referred to herein as a 
NOS-NPTII-NOS gene, was assembled and inserted into a 
10 variety of plant cells, causing them to become 
resistant to aminoglycoside antibiotics such as 
kanamycin. 

The method used to assemble this chimeric 
15 gene is summarized in the flow chart of Figure 2, and 
described in detail below and in the examples. To 
assist the reader in understanding the steps of this 
method, various plasmids and fragments involved in the 
NOS-NPTII-NOS chimeric gene are- cited in parentheses 
20 in Figure 2. However, the method of Figure 2 is 

applicable to a wide variety of other plasmids and 
fragments. To further assist the reader, the steps 
shown in Figure 2 have been assigned callout numbers 
42 et seq. These callout numbers are cited in the 
25 following description. The techniques and DNA 

sequences of this invention are likely to be useful in 
the transformation of a wide variety of plants, 
including any plant which may be infected by one or 
more strains of A. tumefaciens or A. rhizoqenes . 

30 

The NOS Promoter Region and 5' Non- translated Region 

The Applicants decided to obtain and utilize 
35 a nopaline synthase (NOS) promoter region to control 
the expression of the heterologous gene. The NOS is 
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normally carried in certain types of Ti plasmids, such 
as pTiT37. Sciaky et al, 1978. The NOS promoter is 
normally inactive while in an A. tumefaciens cell. 
The entire NOS gene, including the promoter and the 
5 protein coding sequence, is within the T-DNA portion 
of a Ti plasmid that is inserted into the chromosomes 
of plant cells when a plant becomes infected and forms 
a crown gall tumor. Once inside the plant cell, the 
NOS promoter region directs RNA polymerase within a 
10 plant cell to transcribe the NOS protein coding 

sequence into mRNA, which is subsequently translated 
into the NOS enzyme. 

The boundaries between the different parts 

15 of a promoter region (shown in Figure 1 as association 
region 2, intervening region 4, transcription 
initiation sequence 6, and intervening region 8), and 
the boundary between the promoter region and the 5* 
non- translated region, are not fully understood. The 

20 Applicants decided to utilize the entire promoter 

region and 5' non- translated region from the NOS gene, 
which is known to be expressed in plant cells. 
However, it is entirely possible that one or more of 
these sequences might be modified in various ways, 

25 such as alteration in length or replacement by other 

sequences. Such modifications in promoter regions and 
5' non- translated regions have been studied in 
bacterial cells (see, e.g., Roberts et al 1979) and 
mammalian cells (see, e.g., McKnight, 1982). By 

30 utilizing the methodology taught by this invention, it 
is now possible to study the effects of modifications 
to promoter regions and 5 1 non-translated regions on 
the expression of genes in plant cells. It may be 
possible to increase the expression of a gene in a 

35 plant cell by means of such modifications. Such 

modifications, if performed upon chimeric genes of 
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this invention, are within the scope of this 
invention. 

A nopaline-type tumor- inducing plasmid, 
5 designated as pTiT37, was isolated from a strain of 
A. tumefaciens using standard procedures (Currier and 
Nester, 1976). It was digested with the endonuclease 
Hindi I I which produced numerous fragments. These 
fragments were separated by size on a gel, and one of 

10 the fragments was isolated and removed from the gel. 
This fragment was designated as the Hindi I I -23 
fragment, because it was approximately the 23rd 
largest fragment from the Ti plasmid; it is 
approximately 3400 base pairs (bp) in size, also 

15 referred to as 3.4 kilobases (kb). From work by 

others (see, e.g., Hernalsteens et al, 1980), it was 
known that the Hindi I 1-23 fragment contained the 
entire NOS gene, including the promoter region, a 5' 
non-translated region, a structural sequence with a 

20 start codon and a stop codon, and a 3 1 non- trans la ted 
region. The Hindi I 1-23 fragment is shown in Figure 3. 

By means of various cleavage and sequencing 
experiments, it was determined that the Hindi I I -23 

25 fragment could be digested by another endonuclease, 
Sau3a, to yield a fragment, about 350 bp in size, 
which contains the entire NOS promoter region, the 5' 
non-translated region, and the first few codons of the 
NOS structural sequence. This fragment was sequenced, 

30 and the base sequence is represented in Figure 4. The 
start codon (ATG) of the NOS structural sequence 
begins at base pair 301 within the 350 bp fragment. 
The Applicants decided to cleave the fragment between 
base pairs 300 and 301; this would provide them with a 

35 fragment about 300 base pairs long containing a NOS 
promoter region and the entire 5' non- translated 
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region but with no translated bases. To cleave the 
350 bp fragment at precisely the right location, the 
Applicants obtained an M13 clone designated as S1A, 
and utilized the procedure described below. 

5 

To create the SlA clone, Dr. Michael Bevan 
of Washington University converted the 350 bp Sau3a 
fragment into a single strand of DNA. This was done 
by utilizing a virus vector, designated as the M13 mp2 

10 phage, which goes through both double-stranded (ds) 
and single-stranded (ss) stages in its' life cycle 
(Messing et al, 1981). The ds 350 bp fragment was 
inserted into the double-stranded replicative form DNA 
of the M13 mp2, which had been cleaved with BamHI. 

15 The two fragments were ligated, and used to infect 
E. coli cells. The ds DNA containing the 350 bp 
inserted fragment subsequently replicated, and one 
strand (the viral strand) was encapsulated by the M13 
viral capsid proteins. In one clone, designated the 

20 SlA, the orientation of the 350 bp fragment was such 
that the anti-sense strand (containing the same 
sequence as the mRNA) of the NOS gene was carried in 
the viral strand. Viral particles released from 
infected cells were isolated, and provided to the 

25 Applicants. 

Single stranded SlA DNA, containing the 
anti-sense 350 bp fragment with the NOS promoter 
region, was isolated from the viral particles and 

30 sequenced. A 14-mer oligonucleotide primer was 

synthesized, using published procedures (Beaucage and 
Carruthers, 1981, as modified by Adams et al, 1982). 
This 14-mer was designed to be complementary to bases 
287 through 300 of the 350 bp fragment, as shown on 

35 Figure 4. 
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The 5 1 end of the synthetic primer was 
radioactively labelled with 32 P; this is represented 
in the figures by an asterisk. 

5 Copies of the primer were mixed with copies 

of the single-stranded S1A DNA containing the 
anti-sense strand of the 350 bp fragment. The primer 
annealed to the desired region of the S1A DNA, as 
shown at the top of Figure 5. After this occurred, 

10 Klenow DNA polymerase and a controlled quantity of 

unlabelled deoxy-nucleoside triphosphates (dNTP's), A, 
T, C, and G, were added. Klenow polymerase added 
nucleotides to the 3 1 (unlabelled) end of the primer, 
but not to the 5' (labelled) end. The result, as 

15 shown in Figure 5, was a circular loop of 

single-stranded DNA, part of which was matched by a 
second strand of DNA. The 5' end of the second strand 
was located opposite base #300 of the Sau3a insert. 

20 The partially double-stranded DNA was then 

digested by a third endonuclease, Haelll, which can 
cleave both single-stranded and double- stranded DNA. 
Haelll cleavage sites were known to exist in several 
locations outside the 350 bp insert, but none existed 

25 inside the 350 bp insert. This created a fragment 
having one blunt end, and one 3 1 overhang which 
started at base #301 of the Sau3a insert. 

The Haelll fragment mixture was treated with 
30 T4 DNA polymerase and unlabelled dNTP's. This caused 

the single stranded portion of the DNA, which extended 
from base #301 of the Sau3a insert to the closest 
Haelll cleavage site, to be removed from the fragment. 
In this manner, the ATG start codon was removed from 
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base pair #300, leaving a blunt end double-stranded 
fragment which was approximately 550 bp long. 

The mixture was then digested by a fourth 
5 endonuclease EcoRI, which cleaved the 550 bp fragment 
at a single site outside the NOS promoter region. The 
fragments were then separated by size on a gel, and 
the radioactively-labelled fragment was isolated. 
This fragment contained the entire NOS promoter region 
10 and 5 f non- translated region. It had one blunt end 
with a seguence of 
5*-. . . CTGCA 
. . . GACGT 

and one cohesive end (at the EcoRI site) with a 
15 seguence of 

5 » AATTC- 
G- 

20 The shorter strand was about 308 bp long. 

The foregoing steps are represented in 
Figure 2 as steps 42, 44, and 46. 

25 This fragment was inserted into pMON40 

(which is described below) to obtain pMON58, as shown 
on Figure 7. 

Creation of plasmid with NET II gene (pMON40) 

30 

A bacterial transposon, designated as Tn5, 
is known to contain a complete NPT II gene, including 
the promoter region, structural seguence, and 3 ! 
non-translated region. The NPT II enzyme inactivates 
35 certain aminoglycoside antibiotics, such as kanamycin, 
neomycin, and G418; see Jimenez and Davies, 1980. 
This gene is contained within a 1.8 kb fragment, which 
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can be obtained by digesting phage lambda bbkan-1 DNA 
(D. Berg et al, 1975) with two endonucleases , Hindi II 
and BamHI . This fragment was inserted into a common 
laboratory plasmid, pBR327, which had been digested by 
5 Hindu I and BamHI. As shown in Figure 6, the 

resulting plasmid was designated as pMONlOOl, which 
was about 4.7 kb. 

To reduce the size of the DNA fragment which 
10 carried the NPT II structural sequence, the Applicants 
eliminated about 500 bp from the pMONlOOl plasmid, in 
the following manner. First, they digested pMONlOOl 
at a unique Smal restriction site which was outside of 
the NPT II gene. Next, they inserted a 10-mer 
15 synthetic oligonucleotide linker, 
5" CCGGATCCGG 
GGCCTAGGCC 

into the Smal cleavage site. This eliminated the Smal 
cleavage site and replaced it with a BamHI cleavage site. 

20 A second BamHI cleavage site already existed,- about 

500 bp from the new BamHI site. The Applicants digested 
the plasmid with BamHI, separated the 500 bp fragment 
from the 4.2 kb fragment, and circularized the 4.2 kb 
fragment. The resulting plasmids were inserted into 

25 S. coli , hich were then selected for resistance to 
ampicillin and kanamycin. A clonal colony of 
E. coli was selected; these cells contained a plasmid 
which was designated as pMON40, as shown in Figure 6. 

30 The foregoing steps are represented in 

Figure 2 as steps 48 and 50. 

Insertion of NQS promoter into plasmid pMON40 

35 The Applicants deleted the NPT II promoter 

from pMON40 , and replaced it with the NOS promoter 
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fragment described previously, by the following 
method, shown on Figure 7. 

Previous cleavage and sequencing experiments 
(Rao and Rogers, 1979; Auerswald et al, 1980) 
5 indicated that a Bglll cleavage site existed in the 
NPT II gene between the promoter region and the 
structural sequence. Plasmid pMON40 was digested 
with Bglll. The cohesive ends were then filled in by 
mixing the cleaved plasmid with Klenow polymerase and 
10 the four dNTP's, to obtain the following blunt ends: 
5' - AGATC GATCT- 

- TCTAG CTAGA-5 1 

The polymerase . and dNTP's were removed, and the 

15 cleaved plasmid was then digested with EcoRI . The 

smaller fragment which contained the NPT II promoter 
region was removed, leaving a large fragment with one 
EcoRI end and one blunt end. This large fragment was 
mixed with the 308 bp fragment which contained the NOS 

20 promoter, described previously and shown on Figure 5. 

The fragments were ligated, and inserted into E. coli . 
E. coli clones were selected for ampicillin 
resistance. Replacement of the NPT II promoter region 
(a bacterial promoter) with the NOS promoter region 

25 (which is believed to be active only in plant cells) 
caused the NPT II structural sequence to become 
inactive in E. coli . Plasmids from 36 
kanamycin- sensitive clones were obtained; the plasmid 
from one clone, designated as pMON58, was utilized in 

30 subsequent work. 

The foregoing steps are represented in 
Figure 2 as steps 52 and 54. 

35 Plasmid pMON58 may be digested to obtain a 

1.3 kb EcoRI -BamHI fragment which contains the NOS 
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promoter region, the NOS 5' non-translated region, and 
the NPT II structural sequence. This step is 
represented in Figure 2 as step 56. 

Insertion of NOS 3' sequence into NPT II gene 

5 

As mentioned above in "Background Art", the 
functions of 3' non-translated regions in eucaryotic 
genes are not fully understood. However, they are 
believed to contain at least one important sequence, a 
10 poly-adenylation signal. 

It was suspected by the Applicants that a 
gene having a bacterial 3» non-translated region might 
not be expressed as effectively in a plant cell as the 

15 same gene having a 3' non- translated region from a 

gene, such as NOS, which is known to be expressed in 
plants. Therefore, the Applicants decided to add a 
NOS 3' non- translated region to the chimeric gene, in 
addition to the NPT II 3' non- translated region 

20 already present. Alternately, it is possible, using 
the methods described herein, to delete the NPT II or 
other existing 3 ' non- translated region and replace 
it with a desired 3' non- translated region that is 
known to be expressed in plant cells. Whether a 

25 different type of 3' non- translated region (such as a 
3 ' region from an octopine-type or agropine-type Ti 
plasmid, or a 3' region from a gene that normally 
exists in a plant cell) would be suitable or preferable 
for use in any particular type of chimeric gene, for 

30 use in any specific type of plant cell, may be 

determined by those skilled in the art through routine 
experimentation using the method of this invention. 

Those skilled in the art may also determine 
35 through routine experimentation whether the 3 ' 
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non-translated region that naturally follows a 
structural sequence that is to be inserted into a 
plant cell will enhance the efficient expression of 
that structural sequence in that type of plant cell. 
5 If so, then the steps required to insert a different 
3' non-translated region into the chimeric gene might 
not be required in order to perform the method of this 
invention. 

10 In order to obtain a DNA fragment containing 

a NOS 3' non- trans la ted region appropriate for joining 
to the NPT II structural sequence from pMON58 
(described previously), the Applicants utilized a 3.4 
kb Hindi I 1-23 fragment from a Ti plasmid, shown on 

15 Figure 3. This 3.4 kb fragment was isolated and 

digested with BamHI to obtain a 1.1 kb BamHI-Hindlll 
fragment containing a 3' portion of the NOS structural 
sequence (including the stop codon), and the 3 1 
non-translated region of the NOS gene (including the 

20 poly-adenylation signal). This 1.1 kb fragment was 

inserted into a pBR327 plasmid which had been digested 
with Hindi I I and BamHI. The resulting plasmid was 
designated as pMON42, as shown on Figure 8. 

25 Plasmid pMON42 was digested with BamHI and 

Rsal, and a 720 bp fragment containing the desired NOS 
3' non-translated region was purified on a gel. The 
720 bp fragment was digested with another 
endonuclease, Mbol, and treated with the large 

30 fragment of E. coli DNA polymerase I . This resulted 
in a 260 bp fragment with Mbol blunt ends, containing 
a large part of the NOS 3' non- trans la ted region 
including the poly-A signal. 

35 The foregoing procedure is represented in 

Figure 2 by step 58. However, it is recognized that 
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alternate means could have been utilized; for example, 
it might have been possible to digest the Hindi I I -23 
fragment directly with Mbol to obtain the desired 260 
bp fragment with the NOS 3' non- translated region. 

5 Assembly of Chimeric Gene 

To complete the assembly of the chimeric 
gene, it was necessary to ligate the 260 bp Mbol 
fragment (which contained the NOS 3' non-translated 

10 region) to the 1.3 kb EcoRI-BamHI fragment from pMON58 
(which contained the NOS promoter region and 5' 
non- translated region and the NPT II structural 
sequence). In order to facilitate this ligation and 
control the orientation of the fragments, the 

15 Applicants decided to convert, the Mbol ends of the 260 
bp fragment into a BamHI end (at the 5" end of the 
fragment) and an EcoRI end (at the 3' end of the 
fragment). In order to perform this step, the 
Applicants used the following method. 

20 

The 260 bp Mbol fragment, the termini of 
which had been converted to blunt ends by Klenow 
polymerase, was inserted into M13 mp8 DNA at a Smal 
cleavage site. The Smal site is surrounded by a 

25 variety of other cleavage sites present in the M13 mp8 
DNA, as shown in Figure 8. The Mbol fragment could be 
inserted into the blunt Smal ends in either 
orientation. The orientation of the Mbol fragments in 
different clones were tested, using Hinfl cleavage 

30 sites located assymetrically within the Mbol fragment. 
A clone was selected in which the 3 • end of the NOS 3 » 
non- translated region was located near the EcoRI 
cleavage site in the M13 mp8 DNA. This clone was 
designated as the M-2 clone, as shown in Figure 8. 

35 
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Replicative form (double stranded) DNA from 
the M-2 clone was digested by EcoRI and BamHI and a 
280 bp fragment was isolated. Separately, plasmid 
pMON58 was digested by EcoRI and BamHI, and a 1300 bp 
5 fragment was isolated. The two fragments were 
ligated, as shown in Figure 9, to complete the 
assembly of a NOS-NFTII-NOS chimeric gene having EcoRI 
ends. 

10 There are a variety of ways to control the 

ligation of the two fragments. For example, the two 
EcoRI -BamHI fragments could be joined together with 
DNA ligase and cleaved with EcoRI. After inactivation 
of EcoRI, a vector molecule having EcoRI ends that 

15 were treated with calf alkaline phosphatase (CAP) may 
be added to the mixture. The fragments in the mixture 
may be ligated in a variety of orientations. The 
plasmid mixture is used to transform E. coli , and 
cells having plasmids with the desired orientation are 

20 selected or screened, as described below. 

A plasmid, designated as pMON38, was 
created by insertion of the HindIII-23 fragment (from 
Ti plasmid pTiT37) into the Hindi I I cleavage site of 

25 the plasmid pBR327. Plasmid pMON38 contains a unique 
EcoRI site, and an ampicillin-resistance gene which is 
expressed in E. coli . Plasmid pMON38 was cleaved with 
EcoRI and treated with alkaline phosphatase to prevent 
it from re-ligating to itself. U.S. Patent 4,264,731 

30 (Shine, 1981). The resulting fragment was mixed with 
the 1300 bp NOS-NPTII fragment from pMON58, and the 
280 bp NOS fragment from M-2, which had been ligated 
and EcoRI -cleaved as described in the previous 
paragraph. The fragments were ligated, and inserted 

35 into E. coli . The E. coli cells which had acquired 

intact plasmids with ampicillin-resistance genes were 
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selected on plates containing ampicillin. Several 
clones were selected, and the orientation of the 
inserted chimeric genes was evaluated by means of 
cleavage experiments. Two clones having plasmids 
carrying NOS-NPT II-NOS inserts with opposite 
orientations were selected and designated as pMON75 
and pMON76, as shown in Figure 9. The chimeric gene 
may be isolated by digesting either pMON75 or pMON76 
with EcoRI and purifying a 1580 bp fragment. 

The foregoing procedure is represented on 
Figure 2 by step 60. 

This completes the discussion of the 
NOS-NPTII-NOS chimeric gene. Additional information 
on the creation of this gene is provided in the 
Examples. A copy of this chimeric gene is contained 
in plasmid pMON128; it may be removed from pMON128 by 
digestion with EcoRI. A culture of E. coli containing 
PMON128 has been deposited with the American Type 
Culture Collection; this culture has been assigned 
accession number 39264. 

To prove the utility of this chimeric gene, 
the Applicants inserted it into plant cells. The 
NPTII structural sequence was expressed in the plant 
cells, causing them and their descendants to acquire 
resistance to concentrations of kanamycin which are 
normally toxic to plant cells. 

Creation of NPT I Chimeric Gene 



35 



In an alternate preferred embodiment of this 
invention, a chimeric gene was created comprising (1) 
a NOS promoter region and 5' non- translated region, 
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(2) a structural sequence which codes for NPT I, and 

(3) a NOS 3' non-translated region. 

NPT I and NPT II are different and distinct 
5 enzymes with major differences in their amino acid 

sequences and substrate specificities. See, e.g., E. 
Beck et al, 1982. The relative stabilities and 
activities of these two enzymes in various types of 
plant cells are not yet fully understood, and NPT I 
10 may be preferable to NPT II for use in certain types 
of experiments and plant transformations. 

A 1200 bp fragment containing an entire NPT 
I gene was obtained by digesting pACYC177 (Chang and 

15 Cohen, 1978) with the endonuclease, Avail. The Avail 
termini were converted to blunt ends with Klenow 
polymerase, and converted to BamHI termini using a 
synthetic linker. This fragment was inserted into a 
unique BamHI site in a pBR327-derived plasmid, as 

20 shown in Figure 11. The resulting plasmid was 
designated as pMON66. 

Plasmid pMON57 (a deletion derivative of 
pBR327, as shown in Figure 11) was digested with 
25 Avail. The 225 bp fragment of pMON57 was replaced by 
the analogous 225 bp Avail fragment taken from plasmid 
pUC8 (Vieira and Messing, 1982), to obtain a 
derivative of pMON57 with no PstI cleavage sites. 
This plasmid was designated as pM0N67. 

30 

Plasmid pMON58 (described previously and 
shown in Figure 7) was digested with EcoRI and BamHI 
to obtain a 1300 bp fragment carrying the NOS promoter 
and the NPT II structural sequence. This fragment was 
35 inserted into pMON67 which had been digested with 
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EcoRI and BamHI. The resulting plasmid was designated 
as pMON73, as shown in Figure 12. 



5 a 2.4 kb fragment was isolated containing a NOS 

promoter region and 5» non-translated region. Plasmid 
pMON66 (shown on Figure 11) was digested with Xhol and 
BamHI to yield a 950 bp fragment containing the 
structural sequence of NPT I. This fragment lacked 

10 about 30 nucleotides at the 5 ' end of the structural 
sequence. A synthetic linker containing the missing 
bases, having appropriate PstI and Xhol ends, was 
created. The pMON73 fragment, the pMON66 fragment, 
and the synthetic linker were ligated together to 

15 obtain plasmid pMON78„ as shown in Figure 13. This 
plasmid contains the NOS promoter region and 5' 
non- translated region joined to the NPT I structural 
sequence. The ATG start codon was in the same 
position that the ATG start codon of the NOS 

20 structural sequence had occupied. 

Plasmid pMON78 was digested with EcoRI 
and BamHI to yield a 1300 bp fragment carrying the 
chimeric NOS-NPT I regions. Double-stranded DNA from 

25 the M-2 clone (described previously and shown on 

Figure 9) was digested with EcoRI and BamHI, to yield 
a 280 bp fragment carrying a NOS 3 1 non- trans la ted 
region with a poly-adenylation signal. The two 
fragments described above were ligated together to 

30 create the NOS-NPT I-NOS chimeric gene, which was 

inserted into plasmid pMON38 (described above) which 
had been digested with EcoRI. The two resulting 
plasmids, having chimeric gene inserts with opposite 
orientations, were designated as pMON106 and pMON107, 

35 as shown in Figure 14. 



pMON73 was digested with PstI and BamHI, and 
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Either of plasmids pMON106 or pMON107 may be 
digested with EcoRI to yield a 1.6 kb fragment 
containing the chimeric NOS-NPT I-NOS gene. This 
fragment was inserted into plasmid pMON120 which had 
5 been digested with EcoRI and treated with alkaline 
phosphatase. The resulting plasmids, having inserts 
with opposite orientations, were designated as pMON130 
and pMON131, as shown on Figure 15. 

10 The NOS-NPT I-NOS chimeric gene was inserted 

into plant cells, which acquired resistance to 
kanamycin. This demonstrates expression of the 
chimeric gene in plant cells. 

15 Creation of Chimeric Gene with Soybean Promoter 

In an alternate preferred embodiment of this 
invention, a chimeric gene was created comprising (1) 
a promoter region and 5' non- translated region taken 

20 from a gene which naturally exists in soybean; this 
gene codes for the small subunit of 
ribulose-1 , 5 - bis -phosphate carboxylase (sbss, for 
soybean small subunit); (2) a structural sequence 
which codes for NPT II, and (3) a NOS 3' 

25 non- translated region. 

The sbss gene codes for a protein in soybean 
leaves which is involved in photosynthetic carbon 
fixation. The sbss protein is the most abundant 
30 protein in soybean leaves (accounting for about 10% of 
the total leaf protein), so it is likely that the sbss 
promoter region causes prolific transcription. 

There are believed to be approximately six 
35 genes encoding the sbss protein in the soybean 
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genome. One of the members of the sbss gene 
family, SRS1, which is highly transcribed in soybean 
leaves, has been cloned and characterized. The 
promoter region, 5' nontranslated region, and a 
5 portion of the structural sequence are contained on a 
2.1 kb EcoRI fragment that was subcloned into the 
EcoRI site of plasmid pBR325 (Bolivar, 1978). The 
resultant plasmid, pSRS2.1, was a gift to Monsanto 
Company from Dr. R. B. Meagher, University of Georgia, 
10 Athens, GA. The 2.1 kb EcoRI fragment from pSRS2.1 is 
shown on Figure 16. 



E. coli cells, and cleaved with Mbol to obtain an 800 
15 bp fragment. This fragment was inserted into plasmid 
pKC7 (Rao and Rogers, 1979) which had been cleaved 
with Bglll. The resulting plasmid was designated as 
PMON121, as shown on Figure 17. 

20 Plasmid pMON121 was digested with EcoRI and 

Bell, and a 1200 bp fragment containing the sbss 
promoter region was isolated. Separately, plasmid 
pMON75 (described previously and shown on Figure 9) 
was digested with EcoRI and Bglll, and a 1250 bp 

25 fragment was isolated, containing a NPT II structural 
sequence and a NOS 3 1 non-translated region. The two 
fragments were ligated at the compatible Bcll/Bglll 
overhangs, to create a 2450 bp fragment containing 
sbss-NFT II-NOS chimeric gene. This fragment was 

30 inserted into pMON120 which had been cleaved with 
EcoRI, to create two plasmids having chimeric gene 
inserts with opposite orientations, as shown in Figure 
18. The plasmids were designated as pM0N141 and 
PMON142 . 



Plasmid pSRS2.1 was prepared from dam- 



35 




WO 84/02913 



PCT/US84/00048 



-41- 



The sbss-NPTII-NOS chimeric genes were 
inserted into several types of plant cells, causing 
the- plant cells to acquire resistance to kanamycin. 

This successful transformation proved that a 
promoter region from one type of plant can cause the 
expression of a gene within plant cells from an 
entirely different genus, family, and order of plants. 

The chimeric sbss-NPT II-NOS gene also had 
another significant feature. Sequencing experiments 
indicated that the 800 bp Mbol fragment contained the 
ATG start codon of the sbss structural sequence. 
Rather than remove this start codon, the Applicants 
decided to insert a stop codon behind it in the same 
reading frame. This created a dicistronic mRNA 
sequence, which coded for a truncated amino portion of 
the sbss polypeptide and a complete NPT II 
polypeptide. Expression of the NPT II polypeptide was 
the first proof that a dicistronic mRNA can be 
translated within plant cells. 

The sbss promoter is contained in plasmid 
pMON154, described below. A culture of E. coli 
containing this plasmid has been deposited with the 
American Type Culture Collection. This culture has been 
assigned accession number 39265. 

Creation of BGH Chimeric Genes 



In an alternate preferred embodiment of this 
invention, a chimeric gene was created comprising (1) 
a sbss promoter region and 5' non-translated region, 
(2) a structural sequence which codes for bovine 
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growth hormone (BGH) and (3) a NOS 3' non- trans la ted 
region. This chimeric gene was created as follows. 

A structural sequence which codes for the 
5 polypeptide, bovine growth hormone, (see, e.g., 
Woychik et al, 1982) was inserted into a 
pBR322-derived plasmid. The resulting plasmid was 
designated as plasmid CP-1. This plasmid was digested 
with EcoRI and Hindlll to yield a 570 bp fragment 

10 containing the structural sequence. Double stranded 
M-2 RF DNA (described previously and shown in Figure 
8) was cleaved with EcoRI and Hindi I I to yield a 290 
bp fragment which contained the NOS 3 ' non- translated 
region with a poly-adenylation signal. The two 

15 fragments were ligated together and digested with 

EcoRI to create an 860 base pair fragment with EcoRI 
ends, which contained a BGH-coding structural sequence 
joined to the NOS 3' non- translated region. This 
fragment was introduced into plasmid pMON38, which had 

20 been digested with EcoRI and treated with alkaline 
phosphatase, to create a new plasmid, designated as 
pMON 108, as shown in Figure 19. 

A unique Bglll restruction site was 
25 introduced at the 5' end of the BGH structural 

sequence by digesting pMON 108 with EcoRI to obtain 
the 860 bp fragment, and using Klenow polymerase to 
create blunt ends on the resulting EcoRI fragment. 
This fragment was ligated into plasmid N25 (a 
30 derivative of pBR327 containing a synthetic linker 

carrying Bglll and Xbal cleavage sites inserted at the 
BamHI site), which had been cleaved with Xbal and 
treated with Klenow polymerase to obtain blunt ends 
(N25 contains a unique Bglll site located 12 bases 
35 from the Xbal site). The resulting plasmid, which 
contained the 860bp BGH-NOS fragment in the 
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orientation shown in Figure 20, was designated as 
plasmid N25-BGH. This plasmid contains a unique Bglll 
cleavage site located about 25 bases from the 5' end 
of the BGH structural sequence. 

5 

Plasmid N25-BGH prepared from dam- E. coli 
cells was digested with Bglll and Clal to yield an 860 
bp fragment which contained the BGH structural 
sequence joined to the NOS 3 1 non- translated region. 

10 Separately, plasmid pMON121 (described previously and 
shown in Figure 17) was prepared from dam- E. coli 
cells and was digested with Clal and Bell to create an 
1100 bp fragment which contained the sbss promoter 
region. The fragments were li gated at their 

15 compatible Bcll/Bglll overhangs, and digested with 

Clal to yield a Clal fragment of about 2 kb containing 
the chimeric sbss-BGH-NOS gene. This fragment was 
inserted into pMON120 (described previously and shown 
in Figure 10) which had been digested with Clal. The 

20 resulting plasmids, containing the inserted chimeric 
gene in opposite orientations were designated pMON147 
and pM0N148, as shown in Figure 21. 

An alternate chimeric BGH gene was created 
25 which contained (1) a NOS promoter region and 5' 

non-translated region, (2) a structural sequence which 
codes for BGH, and (3) a NOS 3' non- translated region, 
by the following method, shown in Figure 22. 

30 Plasmid pMON76 (described above and shown in 

Figure 9) was digested with EcoRI and Bglll to obtain 
a 308 bp fragment containing a NOS promoter region and 
5' non- translated region. Plasmid N25-BGH prepared 
from dam- E. coli cells (described above and shown in 

35 Figure 20) was digested with Bglll and Clal to obtain 
a 900 bp fragment containing a BGH structural sequence 
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and a NOS 3* non- translated region. These two 
fragments were ligated together to obtain a chimeric 
NOS-BGH-NOS gene in a fragment with EcoRI and Clal 
ends. This fragment was ligated with an 8 kb fragment 
5 obtained by digesting pMON120 with EcoRI and Clal. 

The resulting plasmid, designated as pMON149, is shown 
in Figure 22. 

Creation of Chimeric NOS-EPSP-NOS Gene 

10 In an alternate preferred embodiment, a 

chimeric gene was created comprising (1) a NOS 
promoter region and 5' non-translated region, (2) a 
structural sequence which codes for the E. coli 
enzyme, 5-enol pyruvyl shikimate-3 -phosphoric acid 

15 synthase (EPSP synthase) and (3) a NOS 3' 
non- translated region. 

EPSP synthase is believed to be the target 
enzyme for the herbicide, glyphosate, which is 

20 marketed by Monsanto Company under the registered 

trademark, "Roundup." Glyphosate is known to inhibit 
EPSP synthase activity (Amrhein et al, 1980), and 
amplification of the EPSP synthase gene in bacteria is 
known to increase their resistance to glyphosate. 

25 Therefore, increasing the level of EPSP synthase 

activity in plants may confer resistance to glyphosate 
in transformed plants- Since glyphosate is toxic to 
most plants, this provides for a useful method of weed 
control. Seeds of a desired crop plant which has been 

30 transformed to increase EPSP synthase activity may be 
planted in a field. Glyphosate may be applied to the 
field at concentrations which will kill all 
non- transformed plants, leaving the non- trans formed 
plants unharmed. 
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An EPSP synthase gene may be isolated by a 
variety of means, including the following. A lambda 
phage library may be created which carries a variety 
of DNA fragments produced by Hindi I I cleavage of 
5 E. coli DNA. See, e.g., Maniatis et al, 1982. 

The EPSP synthase gene is one of the genes 
which are involved in the production of aromatic amino 
acids. These genes are designated as the "aro" genes; 
10 EPSP synthase is designated as aroA . Cells which do 
not contain functional aro genes are designated as 
aro - cells. Aro - cells must normally be grown on 
media supplemented by aromatic amino acids. See 
Pittard and Wallis, 1966. 

15 

Different lambda phages which carry various 
Hindi I I fragments may be used to infect mutant E. coli 
cells which do not have EPSP synthase genes. The 
infected aro - cells may be cultured on media which 

20 does not contain the aromatic amino acids, and 

transformed aro+ clones which are capable of growing 
on such media may be selected. Such clones are likely 
to contain the EPSP synthase gene. Phage particles 
may be isolated from such clones, and DNA may be 

25 isolated from these phages. The phage DNA may be 
cleaved with one or more restriction endonucleases , 
and by a gradual process of analysis, a fragment which 
contains the EPSP synthase gene may be isolated. 

30 Using a procedure similar to the method 

summarized above, the Applicants isolated an 11 kb 
Hindi I I fragment which contained the entire E. coli 
EPSP synthase gene. This fragment was digested with 
Bglll to produce a 3.5 kb Hindi I I -Bgl I I fragment which 

35 contained the entire EPSP synthase gene. This 3.5 kb 
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fragment was inserted into plasmid pKC7 (Rao and 
Rogers, 1979) to produce plasmid pM0N4 f when is shown 
in Figure 23. 

5 Plasmid pM0N4 was digested with Clal to 

yield a 2.5 kb fragment which contained the EPSP 
synthase structural sequence. This fragment was 
inserted into pBR327 that had been digested with Clal, 
to create pMON8, as shown in Figure 23. 

10 

pMON8 was digested with BamHI and Ndel to 
obtain a 4.9 kb fragment. This fragment lacked about 
200 nucleotides encoding the amino terminus of the 
EPSP synthase structural sequence. 

15 

The missing nucleotides were replaced by 
ligating a Hinfl/Ndel fragment, obtained from pMON8 as 
shown in Figure 24, together with a synthetic 
oligonucleotide sequence containing (1) the EPSP 

20 synthase start codon and the first three nucleotides, 
(2) a unique Bglll site, and (3) the appropriate BamHI 
and Hinfl ends. The resulting plasmid, pMON25, 
contains an intact EPSP synthase structural sequence 
with unique BamHI and Bglll sites positioned near the 

25 start codon. 

Double stranded M-2 DNA (described 
previously and shown in Figure 8) was digested with 
Hindi I I and EcoRI to yield a 290 bp fragment which 

30 contains the NOS 3' non- translated region and 

poly-adenylation signal. This fragment was introduced 
into a pMON25 plasmid that had been digested with 
EcoRI and Hindi I I to create a plasmid, designated as 
PM0N146 (shown in Figure 25) which contains the EPSP 

35 structural sequence joined to the NOS 3" 
non- translated region. 
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PMON146 was cleaved with Clal and Bglll to 
yield a 2.3 kb fragment carrying the EPSP structural 
sequence joined to the NOS 3 1 non- translated region. 
pMON76 (described previously and shown in Figure 9) 
5 was digested with Bglll and EcoRI to create a 310 bp 
fragment containing the NOS promoter region and 5' 
non- translated region. The above fragments were mixed 
with pMON120 (described previously and shown in Figure 
10) that had been digested with Clal and EcoRI, and 
10 the mixture was ligated. The resulting plasmid, 
designated pMON153, is shown in Figure 26. This 
plasmid contains the chimeric NOS-EPSP-NOS gene. 

A plasmid containing a chimeric 

15 sbss-EPSP-NOS gene was prepared in the following 
manner, shown in Figure 27. Plasmid pMON146 
(described previously and shown in Figure 25) was 
digested with Clal and Bglll, and a 2.3 kb fragment 
was purified. This fragment contained the EPSP 

20 synthase structural sequence coupled to a NOS 3' 

non- trans la ted region with a poly-adenylation signal. 
Plasmid pMON121 (described above and shown in Figure 
17) was digested with Clal and Bell, and a 1.1 kb 
fragment was purified. This fragment contains an sbss 

25 promoter region and 5' non-translated region. The two 
fragments were mixed and ligated with T4 DNA ligase 
and subsequently digested with Clal. This created a 
chimeric sbss-EPSP-NOS gene, joined through compatible 
Bglll and Bell termini. This chimeric gene with Clal 

30 termini was inserted into plasmid pMON120 which had 

been digested with Clal and treated with calf alkaline 
phosphatase (CAP). The mixture was ligated with T4 DNA 
ligase. The resulting mixture of fragments and 
plasmids was used to transform E. coli cells, which 

35 were selected for resistance to spectinomycin. A 
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colony of resistant cells was isolated, and the 
plasmid in this colony was designated as pMON154, as 
shown in Figure 27. 

A culture of E. coli containing pM0N154 has 
been deposited with the American Type Culture Center. 
This culture has been assigned accession number 39265. 

Creation of CaMV(19S)-NPT II-NOS Genes 

In an alternate preferred embodiment of this 
invention, a chimeric gene was created which contained 
the following elements: 

1. a promoter region and a 5* non- translated 
region derived from the CaMV (19S) gene, which codes 
for the P66 protein; 

2. a partial coding sequence from the CaMV 
(19S) gene, including an ATG start codon and several 
internal ATG sequences, all of which were in the same 
frame as a TGA termination sequence immediately inside 
the desired ATG start codon of the NPTII gene; 

3. a structural sequence derived from a 
neomycin phosphotransferase II (NPT II) gene; this 
sequence was preceded by a spurious ATG sequence, 
which was in the same reading frame as a TGA sequence 
within the NPT II structural sequence; and, 

4. A 3' non- trans la ted region, including a 
poly-adenylation signal, derived from a nopaline 
synthase (NOS) gene. 

This chimeric gene, referred to herein as 
the CaMV(19S) -NPTII -NOS gene, was inserted into 
plasmid pMON120 to create a plasmid designated as 
PMON156, shown in Figure 29 and described in Example 
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9. Plasmid pMON156 was inserted in A. tumefaciens 
cells, where it formed a co-integrate Ti plasmid by- 
means of a single crossover event with a Ti plasmid in 
the A. tumefaciens cell. The chimeric gene in the 
5 co-integrate plasmid was within a modified T-DNA 

region in the Ti plasmid, surrounded by left and right 
T-DNA borders. 

A similar chimeric gene was created and 
assembled in a plasmid designated as pMON155, shown in 
10 Figure 32 and described in Example 10. This chimeric 
gene resembled the gene in pMON156, with two 
exceptions : 

1. an oligonucleotide linker having stop 
codons in all three reading frames was inserted 

15 between the CaMV(19S) partial structural sequence and 
the NPT II structural sequence; and, 

2. the spurious ATG sequence on the 5' side 
of the NPT II structural sequence was deleted. 

The construction of this chimeric gene is 
20 described in Example 10. This gene was inserted into 
A. tumefaciens cells and subsequently into plant 
cells. Its level of expression was apparently higher 
than the expression of the similar gene in pMON156, as 
assayed by growth on higher concentrations of 
25 kanamycin. A. tumefaciens cells containing 

co-integrate Ti::pMON155 plasmids have been deposited 
with the American Type Culture Center, and have been 
assigned ATCC accession number 39336. 

Creation of Chimeric CaMV(32S )-NPT II-NOS Genes 

30 In an alternate preferred embodiment of this 

invention, a chimeric gene was created comprising 
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5 



(1) a promoter region which causes 
transcription of the 32S CaMV mRNA; 

(2) a structural sequence which codes for 
NPT II; and 

(3) a NOS 3' non-translated region. 



The assembly of this chimeric gene is 



described in Example 11 and Figures 33 through 37. 
This gene was inserted into plant cells and it caused 
them to become resistant to kanamycin. 



by CaMV . Those skilled in the art may determine 
through routine experimentation whether any particular 
plant viral promoter (such as the CaMV promoter) will 
function at satisfactory levels in any particular type 
15 of plant cell, including plant cells that are outside 
of the normal host range of the virus from which the 
promoter was derived. 

Means for Inserting Chimeric Genes Into Plant Cells 



20 foreign DNA into plant cells. One such method, 
utilized by the Applicants, involved inserting a 
chimeric gene into Ti plasmids carried by 
A. tumefaciens , and co-cultivating the A. tumefaciens 
cells with plants. A segment of T-DNA carrying the 

25 chimeric gene was transferred into the plant genome, 
causing transformation. This method is described in 
detail in two separate U.S. patent applications 
entitled "Plasmids for Transforming Plant Cells," 
serial number 458,411, and "Genetically Transformed 

30 Plants," serial number 458,402, both of which were 
filed on January 17, 1983. 



10 



Petunia plants cannot normally be infected 



A variety of methods are known for inserting 
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A variety of other methods are listed below. 
These methods are theoretically capable of inserting 
the- chimeric genes of this invention into plant cells, 
although the reported transformation efficiencies 
5 achieved to date by such methods have been low. The 
chimeric genes of this invention (especially those 
chimeric genes such as NPT I and NPT II, which may be 
utilized as selectable markers) are likely to 
facilitate research on methods of inserting DNA into 
10 plants or plant cells. 

1. One alternate technique for inserting DNA 
into plant cells involves the use of lipid vesicles, 
also called liposomes. Liposomes may be utilized to 
15 encapsulate one or more DNA molecules. The liposomes 
and their DNA contents may be taken up by plant cells? 
see, e.g., Lurguin, 1981. If the inserted DNA can be 
incorporated into the plant genome, replicated, and 
inherited, the plant cells will be transformed. 

20 

To date, efforts to use liposomes to deliver 
DNA into plant cells have not met with great success 
(Fraley and Papahadjopoulos, 1981). Only relatively 
small DNA molecules have been transferred into plant 

25 cells by means of liposomes, and none have yet been 
expressed. However, liposome-delivery technology is 
still being actively developed, and it is likely that 
methods will be developed for transferring plasmids 
containing the chimeric genes of this invention into 

30 plant cells by means involving liposomes. 



2. Other alternate techniques involve 
contacting plant cells with DNA which is complexed 
with either (a) polycationic substances, such as 
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poly-L-orni thine (Davey et al, 1980), or (b) calcium 
phosphate (Krens et al, 1982). Although efficiencies 
of transformation achieved to date have been low, 
these methods are still being actively researched. 

5 

3. A method has been developed involving 
the fusion of bacteria, which contain desired 
plasmids, with plant cells. Such methods involve 
converting the bacteria into spheroplasts and 

10 converting ' the plant cells into protoplasts. Both of 
these methods remove the cell wall barrier from the 
bacterial and plant cells, using enzymic digestion. 
The two cell types can then be fused together by 
exposure to chemical agents, such as polyethylene 

15 glycol. See Hasezawa et al, 1981. Although the 

transformation efficiencies achieved to date by this 
method have been low, similar experiments using 
fusions of bacterial and animal cells have produced 
good results; see Rassoulzadegan et al, 1982. 

20 

4. Two other methods which have been used 
successfully to genetically transform animal cells 
involve (a) direct microinjection of DNA into animal 
cells, using very small glass needles (Capecchi, 

25 1980), and (b) electric-current-induced uptake of DNA 
by animal cells (Wong and Neumann, 1982). Although 
neither of these techniques have been utilized to date 
to transform plant cells, they may be useful to insert 
chimeric genes of this invention into plant cells. 

30 

Use of Chimeric Genes to Identify Plant Regulators 

The chimeric genes of this invention may be 
used to identify, isolate, and study DNA sequences to 
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determine whether they are capable of promoting or 
otherwise regulating the expression of genes within 
plant cells. 

5 For example, the DNA from any type of cell 

can be fragmented, using partial endonuclease 
digestion or other methods. The DNA fragments are 
mixed with multiple copies of a chimeric gene which 
has been cleaved at a unique cleavage site that is 

10 located in the 5' direction from the ATG start codon 

of the structural sequence. Preferably, the structural 
sequence, if properly transcribed, will be translated 
into a selectable marker, such as a polypeptide which 
confers resistance on the host to a selected 

15 antibiotic. The DNA mixture is ligated to form 

plasmids, and the plasmids are used to transform plant 
cells which are sensitive to the selected antibiotic. 
The cells are cultured on media which contains an 
appropriate concentration of the selected antibiotic. 

20 Plant cells will survive and reproduce only if the 

structural sequence is transcribed and translated into 
the polypeptide which confers resistance to the 
antibiotic. This is presumed to occur only if the 
inserted DNA fragment performs the function of a gene 

25 promoter; the resistant colonies will be evaluated 
further to determine whether this is the case. 

Using this technique, it is possible to 
evaluate the promoter regions of bacteria, yeast, 
30 fungus, algae, other microorganisms, and animal cells, 
to determine whether they also function as gene 
promoters in various types of plant cells. It is also 
possible to evaluate promoters from one type of plant 
in other types of plant cells. By using similar 
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methods and varying the cleavage site in the chimeric 
gene, it is possible to evaluate the performance of 
any DNA sequence as a 5 1 non- translated region, a 3 * 
non- translated region, or any type of other regulatory 



utilized in this method of evaluating the regulatory 
effects of various DNA sequences. For example, the 
10 NOS promoter region and/or the NOS 5* non-translated 

region may be deleted from the NOS-NPT I I -NOS chimeric 
gene. This would create a chimeric gene having a 
unique cleavage site but no promoter region in front 
of an NPT II structural sequence. 

15 In case the inserted DNA fragment contains a 

start codon which might (1) alter the reading frame of 
the structural sequence, or (2) alter the amino 
terminus of the polypeptide, it is possible to place 
an oligonucleotide between the cleavage site and the 

20 start codon of the structural sequence. The 

oligonucleotide would contain stop codons in all three 
reading frames. Therefore, if a start codon was 
included in the inserted DNA fragment, the gene would 
be a dicistronic gene. The first polypeptide would be 

25 terminated by whichever stop codon happened to be in 
the reading frame of the inserted start codon. The 
second start codon would begin the translation of a 
separate polypeptide, which would be the selectable 
marker enzyme. 



A variety of phrases which are used in the 
claims must be defined and described to clarify the 
meaning and coverage of the claims. 



5 



sequence . 



If desired, a partial chimeric gene may be 



30 



Meaning of Various Phrases 
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The meaning of any particular term shall be 
interpreted with reference to the text and figures of 
this application. In particular, it is recognized 
that a variety of terms have developed which are used 
5 inconsistently in the literature. For example, a 
variety of meanings have evolved for the term 
"promoter," some of which include the 5 1 
non-translated region and some of which do not. In an 
effort to avoid problems of interpretation, the 

10 Applicants have attempted to define various terms. 

However, such definitions are not presumed or intended 
to be comprehensive and they shall be interpreted in 
light of the relevant literature. 

The term "chimeric gene" refers to a gene 

15 that contains at least two portions that were derived 
from different and distinct genes. As used herein, 
this term is limited to genes which have been 
assembled, synthesized, or otherwise produced as a 
result of man-made efforts, and any genes which are 

20 replicated or otherwise derived therefrom. "Man-made 
efforts" include enzymatic, cellular, and other 
biological processes, if such processes occur under 
conditions which are caused, enhanced, or controlled 
by human effort or intervention; this excludes genes 

25 which are created solely by natural processes. 

As used herein, a "gene" is limited to a 
segment of DNA which is normally regarded as a gene by 
those skilled in the art. For example, a plasmid 

30 might contain a plant-derived promoter region and a 

heterologous structural sequence, but unless those two 
segments are positioned with respect to each other in 
the plasmid such that the promoter region causes the 
transcription of the structural sequence, then those 

35 two segments would not be regarded as included in the 
same gene. 
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This invention relates to chimeric genes 
which, have structural sequences that are 
"heterologous" with respect to their promoter regions. 
This includes at least two types of chimeric genes: 

5 

1. DNA of a gene which is foreign to a 
plant cell. For example, if a structural sequence 
which codes for mammalian protein or bacterial protein 
is coupled to a plant promoter region, such a gene 

10 would be regarded as heterologous. 

2. A plant cell gene which is naturally 
promoted by a different plant promoter region. For 
example, if a structural sequence which codes for a 
plant protein is normally controlled by a low-quantity 

15 promoter, the structural sequence may be coupled with 
a prolific promoter. This might cause a higher 
quantity of transcription of the structural sequence, 
thereby leading to plants with higher protein content. 
Such a structural sequence would be regarded as 

20 heterologous with regard to the prolific promoter. 

However, it is not essential for this 
invention that the entire structural sequence be 
heterologous with respect to the entire promoter 

25 region. For example, a chimeric gene of this 

invention may be created which would be translated 
into a "fusion protein", i.e., a protein comprising 
polypeptide portions derived from two separate 
structural sequences. This may be accomplished by 

30 inserting all or part of a heterologous structural 

sequence into the structural sequence of a plant gene, 
somewhere after the start codon of the plant 
structural sequence. 
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As used herein, the phrase, "a promoter 
region derived from a specified gene" shall include a 
promoter region if one or more parts of the promoter 
5 region were derived from the specified gene. For 
example, it might be discovered that one or more 
portions of a particular plant-derived promoter region 
(such as intervening region 8, shown on Figure 1) 
might be replaced by one or more sequences derived 

10 from a different gene, such as the gene that contains 
the heterologous structural sequence, without reducing 
the expression of the resulting chimeric gene in a 
particular type of host cell. Such a chimeric gene 
would contain a plant-derived association region 2, 

15 intervening region 4, and transcription initiation 
sequence 6, followed by heterologous intervening 
region 8, 5' non- translated region 10 and structural 
sequence 14. Such a chimeric gene is within the scope 
of this invention. 

20 

As used herein, the phrase "derived from" 
shall be construed broadly. For example, a structural 
sequence may be "derived from" a particular gene by a 
variety of processes, including the following: 

25 

1. the gene may be reproduced by various 
means such as inserting it into a plasmid and 
replicating the plasmid by cell culturing, in vitro 
replication, or other methods, and the desired 
30 sequence may be obtained from the DNA copies by 
various means such as endonuclease digestion; 



2. mRNA which was coded for by the gene may 
be obtained and processed in various ways, such as 
35 preparing complementary DNA from the mRNA and then 
digesting the cDNA with endonucleases ; 
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3. the sequence of bases in the structural 
sequence may be determined by various methods, such as 
endbnuclease mapping or the Maxam-Gilbert method. A 
strand of DNA which duplicates or approximates the 

5 desired sequence may be created by various methods, 
such as chemical synthesis or ligation of 
oligonucleotide fragments . 

4. a structural sequence of bases may be 
10 deduced by applying the genetic code to the sequence 

of amino acid residues in a polypeptide. Usually, a 
variety of DNA structural sequences may be determined 
for any polypeptide, because of the redundancy of the 
genetic code. From this variety, a desired sequence 
15 of bases may be selected, and a strand of DNA having 
the selected sequence may be created. 

If desired, any DNA sequence may be modified 
by substituting certain bases for the existing bases. 

20 Such modifications may be performed for a variety of 

reasons. For example, one or more bases in a sequence 
may be replaced by other bases in order to create or 
delete a cleavage site for a particular endonuclease . 
As another example, one or more bases in a sequence 

25 may be replaced in order to reduce the occurrence of 
"stem and loop" structures in messenger RNA. Such 
modified sequences are within the scope of this 
invention. 

30 A structural sequence may contain introns 

and exons; such a structural sequence may be derived 
from DNA, or from an mRNA primary transcript. 
Alternately, a structural sequence may be derived from 
processed mRNA, from which one or more introns have 

35 been deleted. 



¥0 84/02913 



PCT/US84/00048 



-59- 

The Applicants have deposited two cultures 
of E. coli cells containing plasmids pMON128 and 
pM0N154 with the American Type Culture Collection 
(ATCC). These cells have been assigned ATCC accession 
5 numbers 39264 and 39265, respectively. The Applicants 
have claimed cultures of microorganisms having the 
"relevant characteristics" of either culture. As used 
herein, the "relevant characteristics" of a cell 
culture are limited to those characteristics which 

10 make the culture suitable for a use which is 
disclosed, suggested or made possible by the 
information contained herein. Numerous 
characteristics of the culture may be modified by 
techniques known to those skilled in the art; for 

15 example, the cells may be made resistant to a 

particular antibiotic by insertion of a particular 
plasmid or gene into the cells, or the pMON128 or 
pMON154 plasmids might be removed from the designated 
cells and inserted into a different strain of cells. 

20 Such variations are within the scope of this 
invention, even though they may amount to 
improvements, which undoubtedly will occur after more 
researchers gain access to these cell cultures. 

25 Those skilled in the art will recognize, or 

be able to ascertain using no more than routine 
experimentation, numerous equivalents to the specific 
embodiments described herein. Such equivalents are 
within the scope of this invention. 

30 

EXAMPLES 

Example 1 Creation of pMONlOOl 
35 Fifty micrograms (ug) of lambda phage 
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bbkan-1 DNA (Berg et al, 1975) were digested with 100 
vmi ts of Hindi I I (all restriction endonucleases were 
obtained from New England Biolabs, Beverly, MA, and 
were used with buffers according to the suppliers 
5 instructions, unless otherwise specified) for 2 hr at 
37°C. After heat-inactivation (70°C, 10 min), the 3.3 
kb Tn5 Hindi I I fragment was purified on a sucrose 
gradient. One ug of the purified Hindi I I fragment was 
digested with BamHI (2 units, 1 hr, 37°C), to create, a 
10 1.8 kb fragment. The endonuclease was heat 
inactivated. 



were heat inactivated and the cleaved pBR327 DNA was 
added to the BamHI -Hindi I I Tn5 fragments. After 
addition of ATP to a concentration of 0.75mM, 10 units 
of T4 DNA ligase (prepared by the method of Murray et 
20 al, 1979) was added, and the reaction was allowed to 
continue for 16 hours at 12-14°C. One unit of T4 DNA 
ligase will give 90% circularization of one ug of 
Hindll I -cleaved pBR327 plasmid in 5 minutes at 22°C. 

25 The ligated DNA was used to transform 

CaCl 2 -shocked E. coli C600 recA 56 cells (Maniatis et 
al, 1982). After expression in Luria broth (LB) for 1 
hour at 37°C the cells were spread on solid LB media 
plates containing 200 ug/ml ampicillin and 40 ug/ml 

30 kanamycin. Following 16 hour incubation at 37°C, * 
several hundred colonies appeared. Plasmid mini-prep 
DNA was prepared from six of these. (Ish-Horowicz and * 
Burke, 1981). Endonuclease digestion showed that all 
six of the plasmids carried the 1.8 kb Hindlll-BamHI 

35 fragment. One of those isolates was designated as 
pMONlOOl as shown in Figure 6. 



15 



Plasmid pBR327 (Soberon et al, 1981), 1 ug, 
was digested with Hindi I I and BamHI ( 2 units each, 2 
hr, 37°C). Following digestion, the endonucleases 
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Example 2; Creation of PMON40 

Five ug of plasmid pMONlOOl (described in 
Example 1) was digested with Smal. The reaction was 
5 terminated by phenol extraction, and the DNA was 

precipitated by ethanol. A BamHI linker CCGGATCCGG 
(0.1 ug), which had been phosphorylated with ATP and 
T4 polynucleotide kinase (Bethesda Research 
Laboratory, Rockville, MD) was added to 1 ug of the 

10 pMONlOOl fragment- The mixture was treated with T4 
DNA ligase (100 units) for 18 hours at 14°C. After 
heating at 70°C for 10 min to inactivate the DNA 
ligase, the DNA mixture was digested with BamHI 
endonuclease (20 units, 3 hours, 37°C) and separated 

15 by electrophoresis on an 0.5% agarose gel. The band 

corresponding to the 4.2 kb Smal-BamHI vector fragment 
was excised from the gel. The 4.2 kb fragment was 
purified by adsorption on glass beads (Vogelstein and 
Gillespie, 1979), ethanol precipitated and resuspended 

20 in 20 ul of DNA ligase buffer with ATP. T4 DNA ligase 
(20 units) was added and the mixture was incubated for 
1.5 hours at room temperature. The DNA was mixed with 
rubidium chloride- shocked E. coli C600 cells for 
DNA transformation (Maniatis et al, 1982). After 

25 expression for 1 hour at 37°C in LB, the cells were 

spread on LB plates containing 200 ug/ml of ampicillin 
and 20 ug/ml kanamycin. The plates were incubated at 
37°C for 16 hours. Twelve ampicillin-resistant, 
kanamycin-resistant colonies were chosen, 2 ml 

30 cultures were grown, and mini-plasmid preparations 

were performed. Endonuclease mapping of the plasmids 
revealed that ten of the twelve contained no Smal site 
and a single BamHI site, and were of the appropriate 
size, 4.2 kb. The plasmid from one of the ten 

35 colonies was designated as pMON40, as shown in Figure 
6. 



WO 84/02913 



PCT/US84/00048 



-62- 

Example 3: Creation, of NOS Promoter Fragment 

An oligonucleotide with the following 
sequence, 5 1 -TGCAGATTATTTGG-3 ' , was synthesized 
5 (Beaucage and Carruthers, 1981, as modified by Adams 

32 

et al, 1982). This oligonucleotide contained a P 
radioactive label , which was added to the 5 ' thymidine 
residue by polynucleotide kinase. 

10 An M13 mp7 derivative r designated as S1A, 

was given to Applicants by M. Bevan and M.-D. Chilton, 
Washington University, St. Louis, MO. To the best of 
Applicants' knowledge and belief, the SlA DNA was 
obtained by the following method. A pTiT37 plasmid 

15 was digested with Hindlll, and a 3.4 kb fragment was 
isolated and designated as the Hindi I 1-23 fragment. 
This fragment was digested with Sau3a, to create a 344 
bp fragment with Sau3a ends. This fragment was 
inserted into double-stranded, replicative form DNA 

20 from the M13 mp7 phage vector (Messing et al, 1981) 

which had been cut with BamHI. Two recombinant phages 
with 344 bp inserts resulted, one of which contained 
the anti- sense strand of the NOS promoter fragment. 
That recombinant phage was designated as SlA, and a 

25 clonal copy was given to the Applicants. 

The Applicants prepared the single-stranded 
form of the SlA DNA (14.4 ug; 6 pmol), and annealed it 
(10 minutes at 70°C, then cooled to room temperature) 
30 with 20 pmol of the 14-mer oligonucleotide, mentioned 
above. The oligonucleotide annealed to the Sau3a 
insert at bases 286-300 as shown on Figures 4 and 5. 

200 ul of the SlA template and annealed 
35 oligonucleotide were mixed with the four dNTP's 
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10 



15 



20 



25 



(present at a final concentration of ImM, 25 ul) and 
50 ul of Klenow polymerase. The mixture incubated for 
30 minutes at room temperature. During this period, 
the polymerase added dNTP's to the 3' end of the 
oligonucleogide . The polymerase was heat- inactivated 
(70°C, 3 min), and Haelll (160 units) were added. The 
mixture was incubated (1 hour, 55°C), the Haelll was 
inactivated (70°C, 3 min), and the four dNTP's (ImM, 
12 ul) and T4 DNA polymerase (50 units) were added. 
The mixture was incubated (1 hour, 37°C) and the 
polymerase was inactivated (70°C, 3 min). This 
yielded a fragment of about 570 bp. EcoRI (150 units) 
was added, the mixture was incubated (1 hour, 37°C) 
and the EcoRI was inactivated (70°C, 3 min). 

Aliquots of the mixture were separated on 6% 
acrylamide with 25% glycerol. Autoradiography 
revealed a radioactively labelled band about 310 bp in 
size. This band was excised. The foregoing procedure 
is indicated by Figure 5. 

Example 4; Creation of pMON58 

Five ug of plasmid pMON40 (described in 
Example 2) were digested with Bglll (10 units, 1.5 
hour, 37°C), and the Bglll was inactivated (70°C, 10 
min). The four dNTP's (ImM, 5 ul) and Klenow 
polymerase (8 units) were added, the mixture was 
incubated (37°C, 40 min), and the polymerase was 
inactivated (70°C, 10 min). EcoRI (10 units) was 
added and incubated (1 hour, 37°C), and calf alkaline 
phosphatase (CAP) was added and incubated (1 hour, 
37°C). A fragment of about 3.9 kb was purified on 
agarose gel using NA-45 membrane (Scheicher and 
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Scheull, Keene NH). The fragment (1.0 pM) was mixed 
with the NOS promoter fragment (0.1 pM), described in 
Example 3, and with T4 DNA ligase (100 units). The 
mixture was incubated (4°C, 16 hr). The resulting 
5 plasmids were inserted into E. coli cells, which were 
selected on media containing 200 ug/ml ampicillin. 
Thirty-six clonal Amp R colonies were selected, and 
mini -preps of plasmids were made from those colonies. 
The plasmid from one colony demonstrated a 308 bp 
10 EcoRI-Bglll fragment, a new Sstll cleavage site 

carried by the 308 bp NOS fragment, and a new PstI 
site. This plasmid was designated as pMON58, as 
shown in Figure 7. pMON58 DNA was prepared as 
described above. 

15 Example 5: Creation of pMON42 



plasmid pBR325 (Bolivar, 1978) carrying the HindIII-23 
fragment of pTIT37 (see Figure 3) in the Hindlll site, 

20 was given to Applicants by M. Bevan and M.-D. Chilton, 
Washington University, St. Louis, MO. DNA of this 
plasmid was prepared and 30 ug were digested with 
Hindi ri (50 units) and BaraHI (50 units). The 1.1 kb 
Hindlll-BamHI fragment- was purified by adsorption on 

25 glass beads (Vogelstein and Gillespie, 1979) after 
agarose gel electrophoresis. The purified fragment 
(0.5 ug) was added to 0.5 ug of the 2.9 kb 
Hindi II-BamHI fragment of pBR327. After treatment with 
DNA ligase (20 units, 4 hours, 22°C), the resulting 

30 plasmids were introduced to E. coli C600 cells. Clones 
resistant to ampicillin at 200 ug/ml were selected on 
solid media; 220 clones were obtained. Minipreps of 
plasmid DNA were made from six of these clones and 



Plasmid pBR325-HindIII-23, a derivative of 
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tested with the presence of a 1.1 kb fragment after 
digestion with Hindi I I and BamHI. One plasmid which 
demonstrated the correct insert was designated pMON42. 
Plasmid pMON42 DNA was prepared as described in 
5 previous examples. 

Example 6; Creation of M13 Clone M-2 



in Example 5) prepared from dam- E;. coli cells were 

10 digested with Rsal and BamHI (50 units of each, 3 

hours, 37oC) and the 720 bp Rsal-BamHI fragment was 
purified using NA-45 membrane. Eight ug of the 
purified 720 bp BamHI -Rsal fragment were digested with 
Mbol (10 min, 70°C), the ends were made blunt by 

15 filling in with the large Klenow fragment of DNA 

polymerase I and the four dNTP's. Then 0.1 ug of the 
resulting DNA mixture was added to 0.05 ug of M13 mp8 
previously digested with Smal (1 unit, 1 hour 37°C) 
and calf alkaline phosphatase (0.2 units). After 

20 ligation (10 units of T4 DNA ligase, 16 hours, 12°C) 
and transfection of E. coli JM101 cells, several 
hundred recombinant phage were obtained. Duplex RF 
DNA was prepared from twelve recombinant, 
phage-carrying clones. The RF DNA (0.1 ug) was 

25 cleaved with EcoRI, (1 unit, 1 hour, 37°C), 

end-labeled with 32 P-dATP and Klenow polymerase, and 
re-digested with BamHI (1 unit, 1 hour, 37°C). The 
EcoRI and BamHI sites span the Smal site. Therefore, 
clones containing the 260 bp Mbol fragment could be 

30 identified as yielding a labelled 270 bp fragment 

after electrophoresis on 6% poly- acryl amide gels and 
autoradiography. Four of the twelve clones carried 
this fragment. The orientation of the insert was 
determined by digestion of the EcoRI -cleaved, 



Seventy- five ug of plasmid pMON42 (described 




WO 84/02913 



PCT/US84/00048 



-66- 

end-labeled RF DNA (0.1 ug) with Hinfl (1 unit, 1 
hour, 37°C). Hinfl cleaves the 260 bp Mbol fragment 
once 99 bp from the 3' end of the fragment and again 
42 bp from the end nearest the NOS coding region. Two 
5 clones of each orientation were obtained. One clone, 
digested as M-2 as shown in Figure 8, contained the 
260 bp fragment with the EcoRI site at the 3 1 end of 
the fragment. M-2 RF DNA was prepared using the 
procedures of Messing, et al 1981. 

10 

Example 7; Creation of pMON75 and pMON76 

Fifty ug of M-2 RF DNA (described in Example 
6) were digested with 50 units of EcoRI and 50 units 

15 of BamHI for 2 hours at 37°C. The 270 bp fragment (1 
ug) was purified using agarose gel and NA-45 membrane. 
Plasmid pMON58 (described in Example 4) was digested 
with EcoRI and BamHI (50 ug, 50 units each, 2 hours, 
37°C) and the 1300 bp fragment was purified using 

20 NA-45 membrane. The 270 bp EcoRI -BamHI (0.1 ug) and 
13-00 bp EcoRI -BamHI (0.5 ug) fragments were mixed, 
treated with T4 DNA ligase (2 units) for 12 hours at 
14°C. After heating at 70°C for 10 minutes to 
inactivate the ligase, the mixture was treated with 

25 EcoRI (10 units) for 1 hour at 37°C, then heated to 
70°C for 10 minutes to inactivate the EcoRI. This 
completed the assembly of a chimeric NOS-NPT II-NOS 
gene on a 1.6 kb fragment, as shown on Figure 9. 

30 Plasmid pMON38 is a clone of the pTiT37 

Hindi I I -23 fragment inserted in the Hindi I I site of 
pBR327 (Soberon et al, 1980). pMON38 DNA (20 ug) was 
digested with EcoRI (20 units, 2 hours, 37°C) and calf 
alkaline phosphatase (0.2 units, 1 hour, 37°C) . The 
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pMON38 DNA reaction was extracted with phenol, 
precipitated with ethanol, dried and resuspended in 20 
ul of 10 mM Tris-HCl, 1 mM EDTA, pH 8. 

0.2 ug of the cleaved pMON38 DNA was added 
5 to the chimeric gene mixture described above. The 
mixture was treated with T4 DNA ligase (4 units, 1 
hour, 22°C) and mixed with Rb chloride-treated E. coli 
C600 recA 56 cells to obtain transformation. After 
plating with selection for ampicillin- resistant (200 

10 ug/ml) colonies, 63 potential candidates were 

obtained. Alkaline mini -preps of plasmid DNA were 
made from 12 of these and screened by restriction 
endonuclease digestion for the proper constructs. 
Plasmid DNA 1 s that contained a 1.5 kb EcoRI fragment 

15 and a new Bglll site were digested with BamHI to 
determine the orientation of the 1.5 kb EcoRI 
fragment. One of each insert orientation was picked. 
One plasmid was designated pMON75 and the other 
pMON76, as shown in Figure 9. DNA from these plasmids 

20 were prepared as described in previous examples. 

Example 8; Creation of plasmids pMON128 and PMON129 

The 1.5 kb EcoRI fragment was excised by 
25 EcoRI digestion from either pMON75 or pMON76 and 
purified after agarose gel electrophoresis as 
described in previous examples. Five ug of DNA from 
plasmid pMON120 (described in a separate application, 
"Plasmids for Transforming Plant Cells," cited 
30 previously) was digested with EcoRI and treated 
with calf alkaline phosphatase. After phenol 
deproteinization and ethanol precipitation, the 
EcoRI-cleaved pMON120 linear DNA was mixed with 0.5 ug 
of the 1.5 kb EcoRI chimeric gene fragment. The 
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mixture was treated with 2 units of T4 DNA ligase for 
1 hour at 22oC. After transformation of E. coli cells 
(Maniatis et al, 1982) and selection of colonies 
resistant to spectinomycin (50 ug/ml), several 
5 thousand colonies appeared. Six of these were picked, 
grown, and plasmid mini-preps made. The plasmid DNA 1 s 
were digested with EcoRI to check for the 1.5 kb 
chimeric gene insert and with BamHI to determine the 
orientation of the insert. BamHI digestion showed 

10 that in pMON128 the chimeric gene was transcribed in 
the same direction as the intact nopaline synthase 
gene of pMON120. The orientation of the insert in 
PMON129 was opposite that in pMON128; the appearance 
of an additional 1.5 kb BamHI fragment in digests of 

15 pMON129 showed that plasmid pMON129 carried a tandem 
duplication of the chimeric NOS-NPT II-NOS gene, as 
shown in Figure 10. 

Example 9: Creation of Plasmid pMON156 



20 gift to Monsanto Company from Dr. R. J. Shepherd, 
University of California, Davis. To the best of 
Applicants' knowledge and belief, these plasmids 
(designated as pOSl) were obtained by inserting the 
entire genome of a CaMV strain designated as CM4-184 
25 (Howarth et 1, 1981) into the Sal I restriction site 
of a pBR322 plasmid (Bolivar et al, 1977). E. coli 
cells transformed with pOSl were resistant to 
ampicillin (Amp R ) and sensitive to tetracycline 



isolation of CaMV DNA which can be used in this 
invention are publicly available; see, e.g., ATCC 
Catalogue of Strains II, p. 387 (3rd edition, 1981). 



Plasmids which contained CaMV DNA were a 



(Tet S ). 



30 



Various strains of CaMV suitable for 
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pOSl DNA was cleaved with Hindlll. Three 
small fragments were purified after electrophoresis on 
an 0.8% agarose gel using NA-45 membrane (Schleicher 
and Schuell, Keene NH). The smallest fragment, about 
5 500 bp in size, contains the 19S promoter- This 

fragment was further purified on a 6% acrylamide gel. 
After various manipulations which did not change the 
sequence of this fragment (shown in Figure 28), it was 
digested with Mbol to create a 455 bp Hindlll-Mbol 

10 fragment. This fragment was mixed with a 1250 bp 
fragment obtained by digesting pM0N75 (described 
in Example 7 and shown in Figure 9) with Bglll and 
EcoRI. This fragment contains the NPTII structural 
sequence and the NOS 3' non- translated region. The 

15 two fragments were ligated together by their compatible 
Mbol and Bglll overhangs to create a fragment containing 
the CaMV(19S) -NPTII -NOS chimeric gene. This fragment was 
inserted into pMON120 which had been cleaved with 
Hindi I I and EcoRI. The resulting plasmid was 

20 designated as pMON156, as shown in Figure 29. 

Plasmid pMON156 was inserted into E. coli 
cells and subsequently into A. tumefaciens cells where 
it formed a co-integrate Ti plasmid having the 
CaMV(19S)-NPT II-NOS chimeric gene surrounded by T-DNA 

25 borders. A. tumefaciens cells containing the 

co-integrate plasmids were co-cultivated with petunia 
cells. The co-cultivated petunia cells were cultured 
on media containing kanamycin. Come of the 
co-cultivated petunia cells survived and produced 

30 colonies on media containing up to 50 ug/ml kanamycin. 
This indicated that the CaMV(19S)-NPT II-NOS genes 
were expressed in petunia cells. These results were 
confirmed by Southern blot analysis of transformed 
plant cell DNA. 
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Example 10; Creation of PM0N155 

Plasmid pMON72 was obtained by inserting a 
1.8 kb Hindlll-BamHI fragment from bacterial 
transposon Tn5 (which contains an NPTII structural 
5 sequence) into a PstI- pBR327 plasmid digested with 
Hindi I I and BamHI. This plasmid was digested with 
Bglll and PstI to remove the NPTII structural 
sequence . 

Plasmid pMONlOOl (described in Example 1 and 
shown in Figure 6) from dam- cells was digested with 
Bglll and PstI to obtain a 218 bp fragment with a 
partial NPTII structural sequence. This fragment was 
digested with Mbol to obtain a 194 bp fragment. 

A triple ligation was performed using (a) 
the iarge Pstl-Bglll fragment of pMON72; (b) the 
Pstl-Mbol fragment from pMONlOOl; and (c) a synthetic 
linker with Bglll and Mbol ends having stop codons in 
all three reading frames. After transformation of 
E. coli cells and selection for ampicillin resistant 
colonies, plasmid DNA from Amp R colonies was analyzed. 
A colony containing a plasmid with the desired structure 
was identified. This plasmid was designated pMONHO, 
as shown on Figure 30. 

In order to add the 3' end of the NPT II 
25 structural sequence to the 5' portion in pMONHO, 
pMONHO was treated with Xhol. The resulting 
overhanging end was filled in to create a blunt end by 
treatment with Klenow polymerase and the four 
deoxy-nucleotide triphosphates (dNTP's), A, T, C, and 
30 G. The Klenow polymerase was inactivated by heat, the 
fragment was digested with PstI, and a 3.6 kb fragment 




15 
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was purified. Plasmid pMON76 (described in Example 7 
and shown in Figure 9) was digested with Hindlll, filled 
in to create a blunt end with Klenow polymerase and the 
four dNTP's, and digested with Pstl. An 1100 bp fragment 
5 was purified, which contained part of the NPT II 

structural sequence, and a nopaline synthase (NOS) 3' 
non- translated region. This fragment was ligated with 
the 3.6 kb fragment from pMONHO. The mixture was 
used to transform E. coli cells; Amp R cells were 

10 selected, and a colony having a plasmid with the 

desired structure was identified. This plasmid was 
designated pMON132, as shown on Figure 31. Plasmid 
pMON93 (shown on Figure 28) was digested with Hindi II, 
and a 476 bp fragment was isolated. This fragment was 

15 digested with Mbol, and a 455 bp fragment was purified 
which contained the CaMV (19S) promoter region and 5 f 
non-translated region. Plasmid pMON132 was digested 
with EcoRI and Bglll to obtain a 1250 bp fragment with 
(1) the synthetic linker equipped with stop codons in 

20 all three reading frames; (2) the NPT II structural 
sequence; and (3) the NOS 3' non- translated region. 
These two fragments were joined together through the 
compatible Mbol and Bglll ends to create a CaMV 
(19S)-NPT I I -NOS chimeric gene. 

25 This gene was inserted into pMON120, which 

was digested with Hindi I I and EcoRI, to create plasmid 
pMON155, as shown in Figure 32. 

Plasmid pMON155 was inserted into 
A. tumefaciens GV3111 cells containing a Ti plasmid, 
30 pTiB653. The pMON155 plasmid formed a cointegrate 
plasmid with the Ti plasmid by means of a single 
crossover event. Cells which contain this 
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co-integrate plasmid have been deposited with the 
American Type Culture Collection, and have been assigned 
ATCC accession number 39336. A fragment which 
contains the chimeric gene of this invention can be 
5 obtained by digesting the co-integrate plasmid with 
Hindi I I and EcoRI, and purifying the 1.7 kb fragment. 
These cells have been used to transform petunia cells, 
allowing the petunia cells to grow on media containing . 
at least 100 ug/ml kanamycin. 

10 Example 11 ; Creation of pMON183 and 184 

Plasmid pOSl (described in Example 9) was 
digested with Bglll, and 1200 bp fragment was 
purified. This fragment contained the 32S promoter 
region and part of the 5' non- translated region. It 
15 was inserted into plasmid pSHL72 which had been 

digested with BamHI and Bglll (pSHL72 is functionally 
equivalent to pAG060, described in Colbere-Garapin et 
al, 1981). The resulting plasmid was designated as 
pMON50, as shown on Figure 33. 

20 The cloned Bglll fragment contains a region 

of DNA that acts as a polyadenylation site for the 32S 
RNA transcript. This polyadenylation region was 
removed as follows: pMON50 was digested with Avail 
and an 1100 bp fragment was purified. This fragment 

25 was digested with EcoRI* and EcoRV. The resulting 190 
bp EcoRI -EcoRV fragment was purified and inserted into 
plasmid pBR327, which had been digested with EcoRI and 
EcoRV. The resulting plasmid, pMONSl, contains the 
CaMV 32S promoter on a 190 bp EcoRV-EcoRI fragment, as 

30 shown on Figure 33. 
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To make certain the entire promoter region 
of CaMV(32S) was present in pMON81, a region adjacent 
to the 5' (EcoRV) end of the fragment was inserted, 
into pMON81 in the following way. Plasmid pMON50 
5 prepared from dam- cells was digested with EcoRI and 
Bglll and the resultant 1550 bp fragment was purified 
and digested with Mbol. The resulting 725 bp Mbol 
fragment was purified and inserted into the unique 
Bglll site of plasmid pKC7 (Rao and Rogers, 1979) to 
10 give plasmid pMON125, as shown in Figure 34. The 
sequence of bases adjacent to the two Mbol ends 
regenerates Bglll sites and allows the 725 bp fragment 
to be excised with Bglll. 

To generate a fragment carrying the 32S 
15 promoter, the 725 bp Bglll fragment was purified from 
pMON125 and was subsequently digested with EcoRV and 
Alul to yield a 190 bp fragment. Plasmid pMON81 was 
digested with BamHI, treated with Klenow polymerase 
and digested with EcoRV. The 3.1 kb 
20 EcoRV-BamHI (blunt) fragment was purified, mixed. with 
the 190 bp EcoRV-AluI fragment and treated with DNA 
ligase. Following transformation and selectin of 
ampicillin-resistant cells, plasmid pMON172 was 
obtained which carries the CaMV(32S) promoter sequence 
25 on a 380 bp BamHI -EcoRI fragment, as shown on Figure 

35. This fragment does not carry the poly-adenylation 
region for the 32S RNA. Ligation of the Alul end to 
the filled-in BamHI site regenerates the BamHI site. 

To rearrange the restriction endonuclease 
30 sites adjacent to the CaMV(32S) promoter, the 380 bp 
BamHI-EcoRI fragment was purified from pMON172, 
treated with Klenow polymerase, and inserted into the 
unique Smal site of phage M13 mp8. One recombinant 
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piiage, M12, carried the 380 bp fragment in the 
orientation shown on Figure 36. The replicative form 
DNA- from this phage carries the 32 S promoter fragment 
on an EcoRI ( 5 1 ) -BamHI { 3 ' ) fragment . 

5 Plasmids carrying a chimeric gene (CaMV(32S) 

promoter region-NPT II structural sequence-NOS 3' 
non-translated region) were assembled as follows. The 
380 bp EcoRI -BamHI CaMV (32S) promoter fragment was 
purified from phage M12 RF DNA and mixed with the 1250 

10 bp Bglll-EcoRI NPT II-NOS fragment from pMON75. 
Joining of these two fragments through their 
compatible BamHI and Bglll ends results in a 1.6 kb 
CaMV(32S)-NPT II-NOS chimeric gene. This gene was 
inserted into pMON120 at the EcoRI site in both 

15 orientations. The resultant plasmids, pMON183 and 184, 
appear in Figure 37. 

These plasmids were used to transform 
petunia cells. The transformed cells are capable of 
growth on media containing 100 ug/ml kanamycin. 
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CLAIMS 

1. A chimeric gene expressed in one or more 
types of plant cells, comprising the following 
regions, in sequence: 

a. a promoter region derived from a gene which 
is naturally expressed in a plant cell; 

b. a 5' non-translated region; and, 

c. a structural sequence which is heterologous 
with respect to the promoter region, 

wherein the promoter region causes the transcription 
of the structural sequence in plant cells. 

2. A chimeric gene expressed in one or more 
types of plant cells, comprising the following 
regions , in sequence: 

a. a promoter region derived from a gene 
which is naturally expressed in a plant cell; 

b. a 5* non- translated region; and, 

c. a structural sequence which is heterologous 
with respect to the promoter region, 

wherein the chimeric gene or an ancestor gene thereof 
was created by in vitro ligation of a first nucleic 
acid sequence containing the promoter region to a 
second nucleic acid sequence containing the 
structural sequence. 

3. A chimeric gene expressed in one or more 
types of plant cells, comprising the following 
regions, in sequence: 

a. a promoter region derived from a gene 
which is naturally expressed in a plant cell; 

b. a 5' non- translated region; and, 

c. a structural sequence which is heterologous 
with respect to the promoter region, 

wherein the chimeric gene is expressed in one or more 
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types of plant cells at a level which is sufficient 
to cause a phenotypic alteration in transformed plant 
cells compared to the untransformed plant cells. 

4. A chimeric gene expressed in one or more 
types of plant cells, comprising the following 
regions, in sequence: 

a. a promoter region derived from a gene 
which is naturally expressed in a plant cell; 

b. a 5 1 nOn- translated region; and, 

c. a structural sequence which is heterologous 
with respect to the promoter region. 

wherein the chimeric gene is expressed in one or more 
types of plant cells at a level which is sufficient 
to create a quantity of a resulting polypeptide which 
can be removed from the plant and purified. 

5. A chimeric gene of Claim 1 containing a 

3 ' non-translated region derived from a gene which is 
naturally expressed in a plant cell. 

6. A chimeric gene of Claim 1 containing a 
3 1 non-translated region containing a 
poly-adenylation signal derived from a gene which is 
naturally expressed in a plant cell. 

7. A chimeric gene of Claim 1 wherein the 

5' non-translated region is derived from a gene which 
is naturally expressed in a plant cell. 

8. A chimeric gene of Claim 1 which is 
characterized by the absence of a start codon between 
the promoter region and the start codon of the 
structural sequence. 

9. A chimeric gene of Claim 1 which 
contains a spurious start codon and a stop codon 
between the promoter region and the start codon of 
the structural sequence, wherein the spurious start 
codon and stop codon are in the same reading frame. 
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10. A chimeric gene of Claim 1 wherein the 
promoter region is derived from a gene on a Ti or Ri 
plasmid which naturally occurs in bacteria of the 
genus Agrobacterium . 
5 11. A chimeric gene of Claim 1 wherein the 

promoter region is derived from a gene which 
naturally exists within the genome of an 
untrans formed plant. 

12. A chimeric gene of Claim 1 wherein the 
10 promoter region is derived from a virus that is 

capable of infection one or more types of plant cells. 

13. A chimeric gene of Claim 13 wherein the 
virus is a strain of a cauliflower mosaic virus. 

14. A chimeric gene of Claim 13 which is 

15 expressed in one or more types of plant cells which 
are not normally infected by that strain of virus. 

15. A chimeric gene of Claim 1 wherein the 
structural sequence is derived from a mammalian gene. 

16. A chimeric gene of Claim wherein the 

20 structural sequence is derived from a bacterial gene. 

17. A chimeric gene of Claim 1 wherein the 
structural sequence codes for the translation of a 
polypeptide which renders one or more types of plant 
cells or plants capable of surviving exposure to a 

25 phytotoxic substance at a concentration which is 
toxic to the untransformed plant cells or plants. 

18. A chimeric gene of Claim 17 wherein the 
phytotoxic substance renders untransformed plant 
cells or undifferentiated plant tissue incapable of 

30 being regenerated into a differentiated plant. 

19. A chimeric gene of Claim 17 wherein the 
phytotoxic substance comprises a herbicide. 
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20. A chimeric gene of Claim 1 wherein the 
structural sequence is derived from a gene which 
naturally exists in untransformed plant cells, and 
wherein the structural sequence is placed under the 

5 control of a promoter region derived from a plant 
gene other than the gene from which the structural 
sequence was derived. 

21. A chimeric gene of Claim 1 wherein a 
portion of the "structural sequence is homologous 

10 with respect to the promoter. 

22. A microorganism containing a chimeric 
gene of Claim 1. 

23. A microorganism containing a chimeric 
gene of Claim 2. 

15 24. A microorganism containing a chimeric 

gene of Claim 3. 

25. A microorganism containing a chimeric 
gene of Claim 4. 

26. A culture of microorganisms containing 
20 plasmids having the relevant characteristics of 

plasmids contained in a culture of cells having an 
ATCC accession number selected from the group 
consisting of 39264 and 39265. 

27. A culture of microorganisms having the 

25 relevant characteristics of a culture of cells having 
an ATCC accession number selected from the group 
consisting of 39264 and 39265. 

28. A culture of microorganisms descended 
from a culture of cells having an ATCC accession 

30 number selected from the group consisting of 39264 
and 39265. 

29. A method of evaluating the effect of a 
first segment of DNA upon the expression of a 
structural sequence in a plant cell, comprising: 
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a. ligating the first segment of DNA to a 
second segment of DNA which contains a structural 
sequence which codes for a marker polypeptide; 

b. inserting the ligated DNA segments into 

the genome of untransformed plant cells which were 
deficient in the marker polypeptide prior to the 
insertion of the ligated DNA segments, thereby 
creating transformed plant cells; 

c. culturing the plant cells; and, 

d. determining whether the quantity of the 
marker polypeptide in the transformed plant cells 
is greater than the quantity of marker 
polypeptide in the untransformed plant cells. 

30. A method of Claim 29 wherein the marker 
polypeptide, if present in a plant cell, causes the 
plant cell to be resistant to one or more phytotoxic 
substances . 

31. A chimeric gene capable of expression 

in one or more types of plant cells, comprising a 
first segment of DNA identified by the method of 
Claim 29 and a structural sequence which is 
heterologous with respect to the first segment of DNA. 

32. A microorganism which contains a 
chimeric gene of Claim 31. 
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